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(ix) FEATURE : 

(A) NAME /KEY: CDS 

(8) LOCATION : 56. . 3454 

(D) OTHER INFORMATION: /product^ "hTRT " 

/note= "human telomerase reverse 
transcriptase (hTRT) catalytic protein 
component " 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 1 : 

GCAGCGCTGC GTCCTGCTGC GCACGTGGGA AGCCCTGGCC CCGGCCACCC CCGCG ATG 

Met 
1 



CCG CGC GCT CCC CGC TGC CGA GCC GTG CGC TCC CTG CTG CGC AGC CAC 

Pro Arg Ala Pro Arg Cys Arg Ala Val Arg Ser Leu Leu Arg Ser His 

5 10 15 

TAC CGC GAG GTG CTG CCG CTG GCC ACG TTC GTG CGG CGC CTG GGG CCC 

Tyr Arg Glu Val Leu Pro Leu Ala Thr Phe Val Arg Arg Leu Gly Pro 

20 25 30 



i 



CAG 
Gin 



GGC 
Gly 
35 



TGG CGG 
Trp Arg 



CTG GTG 
Leu Val 



CAG CGC 
Gin Arg 
40 



GGG GAC 
Gly Asp 



CCG GCG 
Pro Ala 
45 



GCT TTC CGC GCG 
Ala Phe Arg Ala 



202 



CTG 
Leu 
50 



GTG 
Val 



GCC CAG 
Ala Gin 



TGC CTG 
Cys Leu 
55 



GTG TGC 
Val Cys 



GTG CCC 
Val Pro 



TGG GAC 
Trp Asp 
60 



GCA CGG 
Ala Arg 



CCG CCC 
Pro Pro 
65 



250 



CCC 
Pro 



GCC 
Ala 



GCC CCC 
Ala Pro 



TCC TTC 
Ser Phe 
70 



CGC CAG 
Arg Gin 



GTG TCC 
Val Ser 
75 



TGC CTG 
Cys Leu 



AAG GAG 
Lys Glu 



CTG GTG 
Leu Val 
80 



298 



GCC 
Ala 



CGA 
Arg 



GTG CTG 
Val Leu 
85 



CAG AGG 
Gin Arg 



CTG TGC 
Leu Cys 



GAG CGC 
Glu Arg 
90 



GGC GCG 
Gly Ala 



AAG AAC 
Lys Asn 
95 



GTG CTG 
Val Leu 



346 



GCC 



TTC 
Phe 



GGC TTC 
Gly Phe 
100 



GCG CTG 
Ala Leu 



CTG GAC 
Leu Asp 
105 



GGG GCC 
Gly Ala 



CGC GGG 
Arg Gly 



GGC CCC 
Gly Pro 
110 



CCC GAG 
Pro Glu 



394 



GCC 
Ala 



TTC 
Phe 
115 



ACC ACC 
Thr Thr 



AGC GTG 
Ser Val 



CGC AGC 
Arg Ser 
120 



TAC CTG 
Tyr Leu 



CCC AAC 
Pro Asn 
125 



ACG GTG 
Thr Val 



ACC GAC 
Thr Asp 



442 



y-.ia 
130 



CTG 
Leu 



CGG GGG 
Ara Gly 



AGC GGG 
Ser Gly 
135 



GCG TGG 
Ala TrD 



GGG CTG 
Gly Leu 



CTG CTG 
Leu Leu 
140 



CGC CGC 
Arg Arg 



GTG GGC 
Val Gly 
145 



490 



GAC 
Aso 



GAC 
Asp 



GTG CTG 
Val Leu 



GTT CAC 
Val His 
150 



CTG CTG 
Leu Leu 



GCA CGC 
Ala Arg 
155 



TGC GCG 
Cys Ala 



CTC TTT 
Leu Phe 



GTG CTG 
Val Leu 
160 



538 



GCT 
Ala 



CCC AGC 
Pro Ser 
165 



TGC GCC 
Cys Ala 



TAC CAG 
Tyr Gin 



GTG TGC 
Val Cys 
170 



GGG CCG 
Gly Pro 



CCG CTG 
Pro Leu 
175 



TAC CAG 
Tyr Gin 



586 



CTC 
Leu 



GGC 
Gly 



GCT GCC 
Ala Ala 
130 



ACT CAG 
Thr Gin 



GCC CGG 
Ala Arg 
18 5 



CCC CCG 
Pro Pro 



CCA CAC 
Pro His 



GCT AGT 
Ala Ser 
190 



GGA CCC 
Gly Pro 



634 



\GG CGT CTG 



.-.rg Arg 
195 



GGA TGC 
Gly Cys 



GAA CGG 
Glu Arg 
200 



GCC TGG 
Ala Tro 



AAC CAT 
Asn His 
205 



AGC GTC 
Ser Val 



AGG GAG 
Arg Glu 



682 



GCC 
A i a 
210 



GGG 
Gly 



GTC CCC 
Val Pro 



CTG GGC 
Leu Gly 
215 



CTG CCA 
Leu Pro 



GCC CCG 
Ala Pro 



GGT GCG 
Gly Ala 
220 



AGG AGG 
Arg Arg 



CGC GGG 
Arg Gly 
225 



730 



GGC 



AGT 
Ser 



GCC AGC 
Ala Ser 



CGA AGT 
Arg Ser 
230 



CTG CCG 
Leu Pro 



TTG CCC 
Leu Pro 
235 



AAG AGG 
Lys Arg 



CCC AGG 
Pro Arg 



CGT GGC 
Arg Gly 
240 



778 



GCT 



GCC 
Ala 



CCT GAG 
Pro Glu 
24 5 



CCG GAG 
Pro Glu 



CGG ACG 
Arg Thr 



CCC GTT 
Pro Val 
250 



GGG CAG 
Gly Gin 



GGG TCC 
Gly Ser 
255 



TGG GCC 
Trp Ala 



826 



CAC 
His 



CCG 
Pro 



GGC AGG 
Gly Arg 
260 



ACG CGT 
Thr Arg 



GGA CCG 
Gly Pro 
265 



AGT GAC 
Ser Asp 



CGT GGT 
Arg Gly 



TTC TGT 
Phe Cys 
270 



GTG GTG 
Val Val 



874 



4 



TCA CCT GCC AGA GCC GCC GAA GAA GCC ACC TCT TTG GAG GGT GCG CTC 92 2 

Ser Pro Ala Arg Pro Ala Glu Glu Ala Thr Ser Leu Glu Gly Ala Leu 
275 280 285 

TCT GGC ACG CGC CAC TCC CAC CCA TCC GTG GGC CGC CAG CAC CAC GCG 97 0 

Ser Gly Thr Arg His Ser His Pro Ser Val Gly Arg Gin His His Ala 
290 295 300 305 

GGC CCC CCA TCC ACA TCG CGG CCA CCA CGT CCC TGG GAC ACG CCT TGT 1018 
Gly Pro Pro Ser Thr Ser Arg Pro Pro Arg Pro Trp Asp Thr Pro Cys 
310 315 320 

CCC CCG GTG TAC GCC GAG ACC AAG CAC TTC CTC TAC TCC TCA GGC GAC 1066 
Pro Pro Val Tyr Ala Glu Thr Lys His Phe Leu Tyr Ser Ser Gly Asp 
325 330 335 

AAG GAG CAG CTG CGG CCC TCC TTC CTA CTC AGC TCT CTG AGG CCC AGC 1114 
Lys Glu Gin Leu Arg Pro Ser Phe Leu Leu Ser Ser Leu Arg Pro Ser 
340 345 350 

CTG ACT GGC GCT CGG AGG CTC GTG GAG ACC ATC TTT CTG GGT TCC AGG 1162 
Leu Thr Gly Ala Arg Arg Leu Val Glu Thr He Phe Leu Gly Ser Arg 
355 360 365 

CCC TGG ATG CCA GGG ACT CCC CGC AGG TTG CCC CGC CTG CCC CAG CGC 1210 
=-o Tro Met Pro Gly Thr Pro Arg Arg Leu Pro Arg Leu Pro Gin Arg 
370 r 375 380 385 

TAC TGG CAA ATG CGG CCC CTG TTT CTG GAG CTG CTT GGG AAC CAC GCG 1258 
7vr Trp Gin Met Arg Pro Leu Phe Leu Glu Leu Leu Gly Asn His Ala 
390 395 400 

CAC TCC CCC TAC GGG GTG CTC CTC AAG ACG CAC TGC CCG CTG CGA GCT 130 6 

Gin. Cys Pro Tyr Giy Val Leu Leu Lys Thr His Cys Pro Leu Arg Ala 
405 ' 4 10 4 15 

CCG GTC ACC CCA GCA GCC GGT GTC TGT GCC CGG GAG AAG CCC CAG GGC 135 4 

Ala Val Thr Pro Ala Ala Gly Val Cys Ala Arg Glu Lys Pro Gin Gly 
420 425 430 

TCT GTG GCG GCC CCC GAG GAG GAG GAC ACA GAC CCC CGT CGC CTG GTG 14 02 

Ser Val Ala Ala Pro Glu Glu Glu Asp Thr Asp Pro Arg Arg Leu Val 
435 440 445 

CAG CTG CTC CGC CAG CAC AGC AGC CCC TGG CAG GTG TAC GGC TTC GTG 14 50 

Cln Leu Leu Arg Gin His Ser Ser Pro Trp Gin Val Tyr Gly Phe Val 
450 455 460 465 

CGG. GCC TGC CTG CGC CGG CTG GTG CCC CCA GGC CTC TGG GGC TCC AGG 14 98 

Arg Ala Cys Leu Arg Arg Leu Val Pro Pro Gly Leu Trp Gly Ser Arg 
470 475 480 

CAC AAC GAA CGC CGC TTC CTC AGG AAC ACC AAG AAG TTC ATC TCC CTG 154 6 

His Asn Glu Arg Arg Phe Leu Arg Asn Thr Lys Lys Phe He Ser Leu 
4 85 4 90 4 95 

GGG AAG CAT GCC AAG CTC TCG CTG CAG GAG CTG ACG TGG AAG ATG AGC 15 94 

Glv Lys His Ala Lys Leu Ser Leu Gin Glu Leu Thr Trp Lys Met Ser 
500 505 510 



GTG CGG 
Val Arg 
515 

CCG GCC 
Pro Ala 
530 

CAC TGG 
His Trp 



GAC TGC GCT 
Asp Cys Ala 



GCA GAG CAC 
Ala Glu His 



CTG 
Leu 



ATG AGT 
Met Ser 
550 



TGG CTG 
Trp Leu 
520 

CGT CTG 
Arg Leu 
535 

GTG TAC 
Val Tyr 



CGC AGG 
Arg Arg 



CGT GAG 
Arg Glu 

Y 

GTC GTC 
Val Val 



AGC 
Ser 



GAG 
Glu 



GAG 
Glu 
555 



CCA GGG GTT GGC 
Pro Gly Val Gly 
525 

ATC CTG GCC AAG 
lie Leu Ala Lys 
540 

CTG CTC AGG TCT 
Leu Leu Arg Ser 



TGT GTT 
Cys Val 



TTC CTG 
Phe Leu 
545 

TTC TTT 
Phe Phe 
560 



1642 



1690 



1738 



TAT GTC 
Tyr Val 



ACG 
Thr 



GAG ACC 
Glu Thr 
565 



ACG TTT 
Thr Phe 



CAA AAG 
Gin Lys 
570 



AAC 
Asn 



AGG CTC 
Arg Leu 



TTT TTC 
Phe Phe 
575 



TAC CGG 
Tyr Arg 



1786 



AAG AGT 
Lys Ser 



GTC 
Val 
580 



TGG AGC 
Trp Ser 



AAG TTG 
Lys Leu 



CAA AGC 
Gin Ser 
585 



ATT 
lie 



GGA ATC 
Gly lie 



AGA CAG 
Arg Gin 
590 



CAC TTG 
His Leu 



1834 



AAG AGG 
Lys Arg 
595 



GTG 
Val 



CAG CTG 
Gin Leu 



CGG GAG 
Arg Glu 
600 



CTG TCG 
Leu Ser 



GAA 
Glu 



GCA GAG 
Ala Glu 
605 



GTC AGG 
Val Arg 



CAG CAT 
Gin His 



1882 



CGG GAA 
■.rg 



GCC AGG CCC 
Ala Arg Pro 



GCC CTG 
Ala Leu 
615 



CTG ACG 
Leu Thr 



TCC 
Ser 



AGA CTC 
Arg Leu 
620 



CGC TTC 

Arg Phe 



ATC CCC 
lie Pro 
625 



1930 



AAG CCT 
Lvs Pro 



GAC GGG CTG 
Asp Gly Leu 
630 



CGG CCG 
Arg Pro 



ATT GTG 
lie Val 



AAC 
Asn 
635 



ATG GAC 
Met Asp 



TAC GTC 
Tyr Val 



GTG GGA 
Val Gly 
640 



1978 



AGA 
Arg 



ACG TTC CGC 
Thr Phe Arg 
64 5 



AGA GAA 
Arg Glu 



AAG AGG 
Lys Arg 
650 



GCC 
Ala 



GAG 
Glu 



CGT 
Arg 



CTC ACC 
Leu Thr 

655 



TCG AGG 
Ser Arc 



2026 



GTG AAG 

'■ ■:: i. Ly'S 



GCA CTG TTC 
Ala Leu Phe 
660 



AGC GTG 
Ser Val 



CTC AAC 
Leu Asn 
665 



TAC 
Tvr 



GAG 
Glu 



CGG 
Arg 



GCG CGG 
Ala Arg 
670 



CGC CCC 
Arg Pro 



2074 



GGC CTC 
Gly Leu 
675 



CTG GGC GCC 
Leu Gly Ala 



TCT GTG 
Ser Val 
680 



CTG GGC 
Leu Gly 



CTG 

Leu 



GAC 
Asp 



GAT 
Asp 
685 



ATC CAC 
lie His 



AGG GCC 
Arg Ala 



2122 



TGG CGC 
Trp Arg 
690 



ACC TTC GTG 
Thr Phe Val 



CTG CGT 
Leu Arg 
695 



GTG CGG 
Val Arg 



GCC 
Ala 



CAG 
Gin 
700 



GAC 
Asp 



CCG CCG 
Pro Pro 



CCT GAG 
Pro Glu 
705 



2170 



CTG TAC 
Leu Tyr 



TTT GTC AAG 
Phe Val Lys 
710 



GTG GAT 
Val Asp 



GTG ACG 
Val Thr 



GGC 
Gly 
715 



GCG 
Ala 



TAC GAC ACC 
Tyr Asp Thr 



ATC CCC 
lie Pro 
720 



2218 



CAG GAC AGG CTC ACG GAG GTC ATC GCC AGC ATC ATC AAA CCC CAG AAC 
Gin Asp Arg Leu Thr Glu Vai He Ala Ser lie lie Lys Pro Gin Asn 
725 730 735 



2266 



ACG TAC TGC GTG CGT CGG TAT GCC GTG GTC CAG AAG GCC GCC CAT GGG 
Thr Tyr Cys Val Arg Arg Tyr Ala Val Val Gin Lys Ala Ala His Gly 
740 745 750 



2314 



6 



CAC GTC CGC AAG GCC TTC AAG AGC CAC GTC TCT ACC TTG ACA GAC CTC 2 3 62 

His Val Arg Lys Ala Phe Lys Ser His Vai Ser Thr Leu Thr Asp Leu 
755 . 760 765 

CAG CCG TAC ATG CGA CAG TTC GTG GCT CAC CTG CAG GAG ACC AGC CCG 2 410 

Gin Pro Tyr Met Arg Gin Phe Val Ala His Leu Gin Glu Thr Ser Pro 
770 775 780 785 

CTG AGG GAT GCC GTC GTC ATC GAG CAG AGC TCC TCC CTG AAT GAG GCC 24 58 

Leu Arg Asp Ala Val Val lie Glu Gin Ser Ser Ser Leu Asn Glu Ala 
790 795 800 

AGC AGT GGC CTC TTC GAC GTC TTC CTA CGC TTC ATG TGC CAC CAC GCC 2506 
Ser Ser Gly Leu Phe Asp Val Phe Leu Arg Phe Met Cys His His Ala 
805 810 815 

GTG CGC ATC AGG GGC AAG TCC TAC GTC CAG TGC CAG GGG ATC CCG CAG 2554 
Val Arg lie Arg Gly Lys Ser Tyr Val Gin Cys Gin Gly He Pro Gin 
820 ' 825 830 

GGC TCC ATC CTC TCC ACG CTG CTC TGC AGC CTG TGC TAC GGC GAC ATG 2602 
Gly Ser He Leu Ser Thr Leu Leu Cys Ser Leu Cys Tyr Gly Asp Met 
S35 840 845 

GAG AAC AAG CTG ( TTT GCG GGG ATT CGG CGG GAC GGG CTG CTC CTG CGT 2650 
Glu Asn Lys Leu Phe Ala Gly He Arg Arg Asp Gly Leu Leu Leu Arg 
350 855 860 865 

TTG GTG GAT GAT TTC TTG TTG GTG ACA CCT CAC CTC ACC CAC GCG AAA 2 6 98 

Leu Val Aso Asp Phe Leu Leu Val Thr Pro His Leu Thr His Ala Lys 
870 875 880 

ACC TTC CTC AGG ACC CTG GTC CGA GGT GTC CCT GAG TAT GGC TGC GTG 274 6 

Thr Phe Leu Arg Thr Leu Val Arg Gly Vai Pro Glu Tyr Gly Cys Val 
385 8 90 3 95 

GTG AAC TTG CGG AAG ACA GTG GTG AAC TTC CCT GTA GAA GAC GAG GCC ■ 2 7 94 

Vai Asn Leu Arg Lys Thr Val Val Asn Phe Pro Val Glu Asp Glu Ala 
900 ^ 905 910 

CTG GGT GGC ACG GCT TTT GTT CAG ATG CCG GCC CAC GGC CTA TTC CCC 28 4 2 

Leu Gly Gly Thr Ala Phe Val Gin Met Pro Ala His Gly Leu Phe Pro 
915 ' 920 925 

TGG TGC GGC CTG CTG CTG GAT ACC CGG ACC CTG GAG GTG CAG AGC GAC 28 90 

Trp Cys Gly Leu Leu Leu Asp Thr Arg Thr Leu Glu Val Gin Ser Asp 
930 " 935 940 945 

TAC TCC AGC TAT GCC CGG ACC TCC ATC AGA GCC AGT CTC ACC TTC AAC 2 938 

Tyr Ser Ser Tyr Ala Arg Thr Ser He Arg Ala Ser Leu Thr Phe Asn 
950 955 960 

CGC GGC TTC AAG GCT GGG AGG AAC ATG CGT CGC AAA CTC TTT GGG GTC 2 98 6 

Arg Gly Phe Lys Ala Gly Arg Asn Met Arc Arg Lys Leu Phe Gly Vai 
965 " 970 975 

TTG CGG CTG AAG TGT CAC AGC CTG TTT CTG GAT TTG CAG GTG AAC AGC 3034 
Leu Arg Leu Lys Cys His Ser Leu Phe Leu Asp Leu Gin Val Asn Ser 
980 935 990 



CTC CAG ACG GTG TGC ACC AAC ATC TAC AAG ATC CTC CTG CTG CAG GCG 3082 
Leu Gin Thr Val Cys Thr Asn He Tyr Lys He Leu Leu Leu Gin Ala 
995 1000 1005 

TAC AGG TTT CAC GCA TGT GTG CTG CAG CTC CCA TTT CAT CAG CAA GTT 3130 
Tyr Arg Phe His Ala Cys Val Leu Gin Leu Pro Phe His Gin Gin Val 
1010 1015 1020 1025 

TGG AAG AAC CCC ACA TTT TTC CTG CGC GTC ATC TCT GAC ACG GCC TCC 3178 
Trp Lys Asn Pro Thr Phe Phe Leu Arg Val He Ser Asp Thr Ala Ser 
1030 1035 1040 

CTC TGC TAC TCC ATC CTG AAA GCC AAG AAC GCA GGG ATG TCG CTG GGG 322 6 

Leu Cys Tyr Ser He Leu Lys Ala Lys Asn Ala Gly Met Ser Leu Gly 
1045 1050 1055 

GCC AAG GGC GCC GCC GGC CCT CTG CCC TCC GAG GCC GTG CAG TGG CTG 3274 
Ala Lys Gly Ala Ala Gly Pro Leu Pro Ser Glu Ala Val Gin Trp Leu 
1060 1065 1070 

TGC CAC CAA GCA TTC CTG CTC AAG CTG ACT CGA CAC CGT GTC ACC TAC 3322 
Cys His Gin Ala Phe Leu Leu Lys Leu Thr Arg His Arg Val Thr Tyr 
1075 1080 1085 

GTG CCA CTC CTG GGG TCA CTC AGG ACA GCC CAG ACG CAG CTG AGT CGG 3370 
Val Pro Leu Leu Gly Ser Leu Arg Thr Ala Gin Thr Gin Leu Ser Arg 
1090 1095 1100 1105 

AAG CTC CCG GGG ACG ACG CTG ACT GCC CTG GAG GCC GCA GCC AAC CCG 34 18 

Lys Leu Pro Gly Thr Thr Leu Thr Ala Leu Glu Ala Ala Ala Asn Pro 
1110 1115 1120 

GCA CTG CCC TCA GAC TTC AAG ACC ATC CTG GAC TGATGGCCAC CCGCCCACAG 34 71 

Ala Leu Pro Ser Asp Phe Lys Thr He Leu Asp 
1125 1130 

CCAGGCCGAG AG CAG AC ACC AGCAGCCCTG TCACGCCGGG CTCTACGTCC CAGGGAGGGA 35 31 

GGGGCGGCCC ACACCCAGGC CCGCACCGCT GGGAGTCTGA GGCCTGAGTG AGTGTTTGGC 3591 

CGAGGCCTGC ATGTCCGGCT GAAGGCTGAG TGTCCGGCTG AGGCCTGAGC GAGTGTCCAG 3651 

CCAAGGGCTG AGTGTCCAGC ACACCTGCCG TCTTCACTTC CCCACAGGCT GGCGCTCGGC 3711 

TCCACCCCAG GGCCAGCTTT TCCTCACCAG GAGCCCGGCT TCCACTCCCC ACATAGGAAT 37 71 

AGTCCATCCC CAGATTCGCC ATTGTTCACC CCTCGCCCTG CCCTCCTTTG CCTTCCACCC 38 31 

CCACCATCCA GGTGGAGACC CTGAGAAGGA CCCTGGGAGC TCTGGGAATT TGGAGTGACC 38 91 

AAAGGTGTGC CCTGTACACA GGCGAGGACC CTGCACCTGG ATGGGGGTCC CTGTGGGTCA 3951 

AATTGGGGGG AGGTGCTGTG GG AGT AAA AT ACTGAATA.TA TGAGTTTTTC AGTTTTGAAA 4011 

AAAA 4 015 



(2) INFORMATION FOR SEQ ID NO: 2: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1132 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:2: 

Met Pro Arg Ala Pro Arg Cys Arg Ala Val Arg Ser Leu Leu Arg Ser 
15 10 15 

His Tyr Arg Glu Val Leu Pro Leu Ala Thr Phe Val Arg Arg Leu Gly 
20 25 30 

Pro Gin Gly Trp Arg Leu Val Gin Arg Gly Asp Pro Ala Ala Phe Arg 
35 40 45 

Ala Leu Val Ala Gin Cys Leu Val Cys Val Pro Trp Asp Ala Arg Pro 
50 55 60 

p^o Pro Ala Ala Pro Ser Phe Arg Gin Val Ser Cys Leu Lys Glu Leu 
65 70 75 80 

Val Ala Arc Val Leu Gin Arg Leu Cys Glu Arg Gly Ala Lys Asn Val 
85 90 95 

Leu Ala Phe Gly Phe Ala Leu Leu Asp Gly Ala Arg Gly Gly Pro Pro 
100 105 110 

Glu Ala Phe Thr Thr Ser Val Arg Ser Tyr Leu Pro Asn Thr Val Thr 
' ' 115 120 ' 125 

Aso Ala Leu Arg Gly Ser Gly Ala Trp Gly Leu Leu Leu Arg Arg Val 
130 135 140 

Giv As d Aso Val Leu Val His Leu Leu Ala Arg Cys Ala Leu Phe Val 
H5 ' " 150 155 160 

Leu Val Ala Pro Ser Cys Ala Tyr Gin Val Cys Gly Pro Pro Leu Tyr 
165 170 175 

Gin Leu Gly Ala Ala Thr Gin Ala Arg Pro Pro Pro His Ala Ser Gly 
180 185 190 

Pro Arg Arg Arg Leu Gly Cys Glu Arg Ala Trp Asn His Ser Val Arg 
195 200 205 

Glu Ala Gly Val Pro Leu Gly Leu Pro Ala Pro Gly Ala Arg Arg Arg 
210 " 215 220 

Gly Glv Ser Ala Ser Arg Ser Leu Pro Leu Pro Lys Arg Pro Arg Arg 
225 ^ 230 235 240 

Gly Ala Ala Pro Glu Pro Glu Arg Thr Pro Val Gly Gin Gly Ser Trp 
245 250 255 

Ala His Pro Gly Arg Thr Arg Gly Pro Ser Asp Arg Gly Phe Cys Val 
260 265 270 




9 



• 



Val Ser Pro Ala Arg Pro Ala Glu Glu Ala Thr Ser Leu Glu Gly Ala 
275 280 285 

Leu Ser Gly Thr Arg His Ser His Pro Ser Val Gly Arg Gin His His 
290 295 300 

Ala Gly Pro Pro Ser Thr Ser Arg Pro Pro Arg Pro Trp Asp Thr Pro 
305 310 315 320 

Cys Pro Pro Val Tyr Ala Glu Thr Lys His Phe Leu Tyr Ser Ser Gly 
325 330 335 

Asp Lys Glu Gin Leu Arg Pro Ser Phe Leu Leu Ser Ser Leu Arg Pro 
340 345 350 

Ser Leu Thr Gly Ala Arg Arg Leu Val Glu Thr lie Phe Leu Gly Ser 
355 . 360 365 

Arg Pro Trp Met Pro Gly Thr Pro Arg Arg Leu Pro Arg Leu Pro Gin 
370 ' 375 380 

Arg Tyr Trp Gin Met Arg Pro Leu Phe Leu Glu Leu Leu Gly Asn His 
385 390 395 400 

Ala Gin Cys Pro Tyr Gly Val Leu Leu Lys Thr His Cys Pro Leu Arg 
405 410 415 

Ala Ala Val Thr Pro Ala Ala Gly Val Cys Ala Arg Glu Lys Pro Gin 
420 425 430 

Gly Ser Val Ala Ala Pro Glu Glu Glu Asp Thr Asp Pro Arg Arg Leu 
4 35 4 40 44 5 

Val Gin Leu Leu Arg Gin His Ser Ser Pro Trp Gin Val Tyr Gly Phe 
4 50 4 55 4 60 

Val Arg Ala Cys Leu Arg Arg Leu Val Pro Pro Gly Leu Trp Gly Ser 



Arg His Asn Glu Arg Arg Phe Leu Arg Asn Thr Lys Lys Phe lie Ser 
485 490 495 

Leu Gly Lys His Ala Lys Leu Ser Leu Gin Glu Leu Thr Trp Lys Met 
500 505 510 

Ser Val Arg Asp Cys Ala Trp Leu Arg Arg Ser Pro Gly Val Gly Cys 
515 ^ 520 525 

Val Pro Ala Ala Glu His Arg Leu Arg Glu Glu He Leu Ala Lys Phe 
530 535 540 

Leu His Trp Leu Met Ser Val Tyr Val Val Glu Leu Leu Arg Ser Phe 
545 550 555 560 

Phe Tyr Val Thr Glu Thr Thr Phe Gin Lys Asn Arg Leu Phe Phe Tyr 



4 70 



475 



4 80 



565 



570 



575 



Arg Lys Ser Val Trp Ser Lys Leu Gin Ser He Gly He Arg Gin His 
580 585 590 
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Leu Lys Arg Val Gin Leu Arg Glu Leu Ser Glu Ala Glu Val Arg Gin 
595 600 605 

His Arg Glu Ala Arg Pro Ala Leu Leu Thr Ser Arg Leu Arg Phe lie 
610 615 620 

Pro Lys Pro Asp Gly Leu Arg Pro He Val Asn Met Asp Tyr Val Val 
625 630 635 640 

Gly Ala Arg Thr Phe Arg Arg Glu Lys Arg Ala Glu Arg Leu Thr Ser 
645 650 655 

Arg Val Lys Ala Leu Phe Ser Val Leu Asn Tyr Glu Arg Ala Arg Arg 
660 665 670 

Pro Gly Leu Leu Gly Ala Ser Val Leu Gly Leu Asp Asp He His Arg 
675 . 680 685 

Ala Tro Arg Thr Phe Val Leu Arg Val Arg Ala Gin Asp Pro Pro Pro 
690 695 700 

Glu Leu Tyr Phe Val Lys Val Asp Val Thr Gly Ala Tyr Asp Thr He 
705 710 715 720 

Pro Gin Asp Arg Leu Thr Glu Val He Ala Ser He He Lys Pro Gin 
725 730 735 

Asn Thr Tyr Cvs Val Arg Arg Tyr Ala Val Val Gin Lys Ala Ala His 
740 745 750 

GIv His Val Arg Lys Ala Phe Lys Ser His Val Ser Thr Leu Thr Asp 

755 760 765 

Leu Gin Pro Tyr Met Arg Gin Phe Val Ala His Leu Gin Glu Thr Ser 
770 ^ 775 780 

Pro Leu Arc Aso Ala Val Val lie Glu Gin Ser Ser Ser Leu Asn Glu 
78 5 " 7 90 7 95 8 00 

Ala Ser Ser Gly Leu Phe Asp Val Phe Leu Arg Phe Met Cys His His 
805 ^ 810 815 

Ala Val Arg He Arg Gly Lys Ser Tyr Val Gin Cys Gin Gly He Pro 
820 825 830 

Gin Gly Ser He Leu Ser Thr Leu Leu Cys Ser Leu Cys Tyr Gly Asp 
835 840 845 

Met Glu Asn Lys Leu Phe Ala Gly He Arg Arg Asp Gly Leu Leu Leu 
850 855 860 

Arg Leu Val Asp Asp Phe Leu Leu Val Thr Pro His Leu Thr His Ala 
865 870 875 880 

Lys Thr Phe Leu Arg Thr Leu Vai Arg Gly Val Pro Glu Tyr Gly Cys 
885 890 895 



Val Val Asn Leu Arg Lys Thr Val Val Asn Phe Pro Val Glu Asp Glu 
900 905 910 
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Ala Leu Gly Gly Thr Ala Phe Val Gin Met Pro Ala His Gly Leu Phe 
915 920 925 

Pro Trp Cys Gly Leu Leu Leu Asp Thr Arg Thr Leu Glu Val Gin Ser 
930 935 940 

Asd Tyr Ser Ser Tyr Ala Arg Thr Ser He Arg Ala Ser Leu Thr Phe 
945 950 955 960 

Asn Arg Gly Phe Lys Ala Gly Arg Asn Met Arg Arg Lys Leu Phe Gly 
965 970 975 

Val Leu Arg Leu Lys Cys His Ser Leu Phe Leu Asp Leu Gin Val Asn 
980 985 990 

Ser Leu Gin Thr Val Cys Thr Asn He Tyr Lys He Leu Leu Leu Gin 
995 . 1000 1005 

Ala Tyr Arg Phe His Ala Cys Val Leu Gin Leu Pro Phe His Gin Gin 
1010 1015 - 1020 

Val Trp Lys Asn Pro Thr Phe Phe Leu Arg Val He Ser Asp Thr Ala 
1025 ' 1030 1035 1040 

Ser Leu Cys Tyr Ser He Leu Lys Ala Lys Asn Ala Gly Met Ser Leu 
1045 1050 1055 

GLv Aia Lvs Gly Ala Ala Gly Pro Leu Pro Ser Glu Ala Val Gin Trp 
1060 1065 1070 

Leu Cys His Gin Ala Phe Leu Leu Lys Leu Thr Arg His Arg Val Thr 
1075 1080 1085 

Tvr Val Pro Leu Leu Gly Ser Leu Arg Thr Ala Gin Thr Gin Leu Ser 
1090 1095 1100 

Arc Lvs Leu Pro Gly' Thr Thr Leu Thr Ala Leu Glu Ala Ala Ala Asn 
1105 1110 1115 112: 

Pro Ala Leu Pro Ser Asd Phe Lys Thr He Leu Asp 
1125 * H30 



INFORMATION FOR SEQ ID NO : 3 : 

(l) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 2176 base pairs 
(3) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA 



(ix) FEATURE: 

(A) NAME /KEY : - 

(B) LOCATION: 1 . .2176 

(D) OTHER INFORMATION: /note= "clone 712562" 
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(ix) FEATURE : 

(A) NAME /KEY : CDS 

(B) LOCATION: 23. . 802 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 3: 
GGCCAAGTTC CTGCACTGGC TGATGAGTGT GTACGTCGTC GAGCTGCTCA GGTCTTTCTT 60 

TTATGTCACG GAGACCACGT TTCAAAAGAA CAGGCTCTTT TTCTACCGGA AGAGTGTCTG 120 

GAGCAAGTTG CAAAGCATTG GAATCAGACA GCACTTGAAG AGGGTGCAGC TGCGGGAGCT 180 

GTCGGAAGCA GAGGTCAGGC AGCATCGGGA AGCCAGGCCC GCCCTGCTGA CGTCCAGACT 24 0 

CCGCTTCATC CCCAAGCCTG ACGGGCTGCG GCCGATTGTG AACATGGACT ACGTCGTGGG 300 

AGCCAGAACG TTCCGCAGAG AAAAGAGGGC CGAGCGTCTC ACCTCGAGGG TGAAGGCACT 360 

GTTCAGCGTG CTCAACTACG AGCGGGCGCG GCGCCCCGGC CTCCTGGGCG CCTCTGTGCT 4 20 
GGGCCTGGAC GATATCCACA GGGCCTGGCG CACCTTCGTG CTGCGTGTGC GGGCCCAGGA 4 80 

CCCGCCGCCT GAGCTGTACT TTGTCAAGGT GGATGTGACG GGCGCGTACG ACACCATCCC 54 0 

CCAGGACAGG CTCACGGAGG TCATCGCCAG CATCATCAAA CCCCAGAACA CGTACTGCGT 600 
GCGTCGGTAT GCCGTGGTCC AGAAGGCCGC CCATGGGCAC GTCCGCAAGG CCTTCAAGAG 660 
CCACGTCCTA CGTCCAGTGC CAGGGGATCC CGCAGGGCTC CATCCTCTCC ACGCTGCTCT 720 
GCAGCCTGTG CTACGGCGAC ATGGAGAACA AGCTGTTTGC GGGGATTCGG CGGGACGGGC 7 30 

TGCTCCTGCG TTTGGTGGAT GATTTCTTGT TGGTGACACC TCACCTCACC CACGCGAAAA 840 
CC-TCCTCAG GACCCTGGTC CGAGGTGTCC CTGAGTATGG CTGCGTGGTG AACTTGCGGA 900 
AGACAGTGGT GAACTTCCCT GTAGAAGACG AGGCCCTGGG TGGCACGGCT TTTGTTCAGA 9 60 

TGCCGGCCCA CGGCCTATTC CCCTGGTGCG GCCTGCTGCT GGATACCCGG ACCCTGGAGG 1020 

T GCAGAGCGA CTACTCCAGC TATGCCCGGA CCTCCATCAG AGCCAGTCTC ACCTTCAACC 1080 

GCGGCTTCAA GGCTGGGAGG AACATGCGTC GCAAACTCTT TGGGGTCTTG CGGCTGAAGT 114 0 

GTCACAGCCT GTTTCTGGAT TTGCAGGTGA ACAGCCTCCA GACGGTGTGC ACCAACATCT 1200 

ACAAGATCCT CCTGCTGCAG GCGTACAGGT TTCACGCATG TGTGCTGCAG CTCCCATTTC 12 60 

ATCAGCAAGT TTGGAAGAAC CCCACATTTT TCCTGCGCGT CATCTCTGAC ACGGCCTCCC 1320 

TCTGCTACTC CATCCTGAAA GCCAAGAACG CAGGGATGTC GCTGGGGGCC AAGGGCGCCG 138 0 
CCGGCCNTCT GCCCTCCGAG GCCGTGCAGT GGCTGTGCCA CCAAGCATTC CTGCTCAAGC ' 14 4 0 

TGACTCGACA CCGTGTCACC TACGTGCCAC TCCTGGGGTC ACTCAGGACA GCCCAGACGC 1500 

AGCTGAGTCG GAAGCTCCCG GGGACGACGC TGACTGCCCT GGAGGCCGCA GCCAACCCGG 1560 

CACTGCCCTC AGACTTCAAG ACCATCCTGG ACTGATGGCC ACCCGCCCAC AGCCAGGCCG 162 0 
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AGAGCAGACA 


CCAGCAGCCC 


TGTCACGCCG 


GGCTCTACGT 


CCCAGGGAGG 


GAGGGGCGGC 


1680 


CCACACCCAG 


GCCTGCACCG 


CTGGGAGTCT 


GAGGCCTGAG 


TGAGTGTTTG 


GCCGAGGCCT 


1740 


GCATGTCCGG 


CTGAAGGCTG 


AGTGTCCGGC 


TGAGGCCTGA 


GCGAGTGTCC 


AGCCAAGGGC 


1800 


TGAGTGTCCA 


GCACACCTGC 


CGTCTTCACT 


TCCCCACAGG 


CTGGCGCTCG 


GCTCCACCCC 


1860 


AGGGCCAGCT 


TTTCCTCACC 


AGGAGCCCGG 


CTTCCACTCC 


CCACATAGGA 


ATAGTCCATC 


1920 


CCCAGATTCG 


CCATTGTTCA 


CCCCTCGCCC 


TGCCCTCCTT 


TGCCTTCCAC 


CCCCACCATC 


1980 


CAGGTGGAGA 


CCCTGAGAAG 


GACCCTGGGA 


GCTCTGGGAA 


TTTGGAGTGA 


CCAAAGGTGT 


2040 








at fine; fin t 


CCCTGTGGGT 


CAAATTGGGG 


2100 


GGAGGTGCTG 


TGGGAGTAAA 


ATACTGAATA 


TATGAGTTTT 


TCAGTTTTGN 


AAAAAAAAAA 


2160 


AAAAAAAAAA 


AAAAAA 










2176 



(2) INFORMATION FOR SEQ ID NO : 4 : 

(l) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 3855 base pairs 
(3) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA 



(ix) FEATURE: 

(A) NAME /KEY: - 

(3) LOCATION: 1 . . 3355 

(D) OTHER INFORMATION: /noie= "nucleic acid sequence with an 

ooer. reading frame encoding a celta-182 
variant polypeptide" 

(i:-:) FEATURE: 

(A) NAME/ KEY : CDS 

(3) LOCATION: 56 . . 2479 

(D) OTHER INFORMATION: /product^ "delta-182 variant 

polypeptide" 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 4 : 

GCAGCGCTGC GTCCTGCTGC GCACGTGGGA AGCCCTGGCC CCGGCCACCC CCGCG AT' 

Me' 



CCG CGC GCT CCC CGC TGC CGA GCC GTG CGC TCC CTG CTG CGC AGC CAC 106 

Pro Arg Ala Pro Arg Cys Arg Ala Val Arg Ser Leu Leu Arg Ser His 

5 10 15 

TAG CGC GAG GTG CTG CCG CTG GCC ACG TTC GTG CGG CGC CTG GGG CCC 154 

Tvr Arg Giu Val Leu Pro Leu Ala Thr ?he Val Arg Arg Leu Gly Pro 

20 25. 30 
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CAG GGC TGG CGG CTG GTG CAG CGC GGG GAC CCG GCG GCT TTC CGC GCG 202 
Gin Gly Trp Arg Leu Val Gin Arg Gly Asp Pro Ala Ala Phe Arg Ala 
35 40 45 

CTG GTG GCC CAG TGC CTG GTG TGC GTG CCC TGG GAC GCA CGG CCG CCC 2 50 

Leu Val Ala Gin Cys Leu Val Cys Val Pro Trp Asp Ala Arg Pro Pro 
50 55 60 65 

CCC GCC GCC CCC TCC TTC CGC CAG GTG TCC TGC CTG AAG GAG CTG GTG 2 98 

Pro Ala Ala Pro Ser Phe Arg Gin Val Ser Cys Leu Lys Glu Leu Val 
70 75 80 

GCC CGA GTG CTG CAG AGG CTG TGC GAG CGC GGC GCG AAG AAC GTG CTG 34 6 

Ala Arg Val Leu Gin Arg Leu Cys Glu Arg Gly Ala Lys Asn Val Leu 
85 90 95 

GCC TTC GGC TTC GCG CTG CTG GAC GGG GCC CGC GGG GGC CCC CCC GAG 394 
Ala Phe Gly Phe Ala Leu Leu Asp Gly Ala Arg Gly Gly Pro Pro Glu 
100 105 110 

GCC TTC ACC ACC AGC GTG CGC AGC TAC CTG CCC AAC ACG GTG ACC GAC 4 42 

Ala Phe Thr Thr Ser Val Arg Ser Tyr Leu Pro Asn Thr Val Thr Asp 
115 120 125 

GCA CTG CGG GGG AGC GGG GCG TGG GGG CTG CTG CTG CGC CGC GTG GGC 4 90 

Ala Leu Arg Gly Ser Gly Ala Trp Gly Leu Leu Leu Arg Arg Val Gly 
130 " ' 135 140 145 

GAC GAC GTG CTG GTT CAC CTG CTG GCA CGC TGC GCG CTC TTT GTG CTG 538 
Aso Asd Val Leu Val His Leu Leu Ala Arg Cys Ala Leu Phe Val Leu 
150 155 160 

GTG GCT CCC AGC TGC GCC TAC CAG GTG TGC GGG CCG CCG CTG TAC CAG 58 6 

Vai. Ala Pro Ser Cys Ala Tyr Gin Val Cys Gly Pro Pro Leu Tyr Gin 
155 170 175 

CTC GGC GCT GCC ACT CAG GCC CGG CCC CCG CCA CAC GCT AGT GGA CCC 634 
Leu Gly Ala Ala Thr Gin Ala Arg Pro Pro Pro His Ala Ser Gly Pro 
180 185 190 

CGA AGG CGT CTG GGA TGC GAA CGG GCC TGG AAC CAT AGC GTC AGG GAG 68 2 

Arg Arg Arg Leu Gly Cys Glu Arg Ala Trp Asn His Ser Val Arg Glu 
195 " 200 205 

GCC GGG GTC CCC CTG GGC CTG CCA GCC CCG GGT GCG AGG AGG CGC GGG 7 30 

Ala Gly Val Pro Leu Gly Leu Pro Ala Pro Gly Ala Arg Arg Arg Gly 
210 215 220 225 

GGC AGT GCC AGC CGA AGT CTG CCG TTG CCC AAG AGG CCC AGG CGT GGC 778 
Glv Ser Ala Ser Arg Ser Leu Pro Leu Pro Lys Arg Pro Arg Arg Gly 
230 235 240 

GCT GCC CCT GAG CCG GAG CGG ACG CCC GTT GGG CAG GGG TCC TGG GCC 826 
Ala Ala Pro Glu Pro Glu Arg Thr Pro Val Gly Gin Gly Ser Trp Ala 
245 250 255 

CAC CCG GGC AGG ACG CGT GGA CCG AGT GAC CGT GGT TTC TGT GTG GTG 87 4 

His Fro Gly Arg Thr Arg Gly Pro Ser Asp Arg Gly Phe Cys Val Val 
260 265 270 
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TCA CCT GCC AGA CCC GCC GAA GAA GCC ACC TCT TTG GAG GGT GCG CTC 922 
Ser Pro Ala Arg Pro Ala Glu Glu Ala Thr Ser Leu Glu Gly Ala Leu 
275 . 280 285 

TCT GGC ACG CGC CAC TCC CAC CCA TCC GTG GGC CGC CAG CAC CAC GCG 97 0 

Ser Gly Thr Arg His Ser His Pro Ser Val Gly Arg Gin His His Ala 
290 295 300 305 

GGC CCC CCA TCC ACA TCG CGG CCA CCA CGT CCC TGG GAC ACG CCT TGT 1018 
Gly Pro Pro Ser Thr Ser Arg Pro Pro Arg Pro Trp Asp Thr Pro Cys 
310 315 320 

CCC CCG GTG TAC GCC GAG ACC AAG CAC TTC CTC TAC TCC TCA GGC GAC 10 66 

Pro Pro Val Tyr Ala Glu Thr Lys His Phe Leu Tyr Ser Ser Gly Asp 
325 330 335 

AAG GAG CAG CTG CGG CCC TCC TTC CTA CTC AGC TCT CTG AGG CCC AGC 1114 
Lys Glu Gin Leu Arg Pro Ser Phe Leu Leu Ser Ser Leu Arg Pro Ser 
340 345 350 

CTG ACT GGC GCT CGG AGG CTC GTG GAG ACC ATC TTT CTG GGT TCC AGG 1162 
Leu Thr Gly Ala Arg Arg Leu Val Glu Thr He Phe Leu Gly Ser Arg 
355 360 365 

CCC TGG ATG CCA GGG ACT CCC CGC AGG TTG CCC CGC CTG CCC CAG CGC 1210 
Pro Trp Met Pro Gly Thr Pro Arg Arg Leu Pro Arg Leu Pro Gin Arg 
370 * 375 380 385 

TAC TGG CAA ATG CGG CCC CTG TTT CTG GAG CTG CTT GGG AAC CAC GCG . 1258 

Tyr Trp Gin Met Arg Pro Leu, Phe Leu Glu Leu Leu Gly Asn His Ala 
390 395 400 

CAG TGC CCC TAC GGG GTG CTC CTC AAG ACG CAC TGC CCG CTG CGA GCT 130 6 

Gin Cvs Pro Tyr Gly Val Leu Leu Lys Thr His Cys Pro Leu Arg Ala 
4 0 5 4 10 4 15 

GCG GTC ACC CCA GCA GCC GGT GTC TGT GCC CGG GAG AAG CCC CAG GGC 13 54 

Ala Val Thr Pro Ala Ala Gly Val Cys Ala Arg Glu Lys Pro Gin Gly 
420 425 430 

TCT GTG GCG GCC CCC GAG GAG GAG GAC ACA GAC CCC CGT CGC CTG GTG 14 02 

Ser val Ala Ala Pro Glu Glu Glu Asp Thr Asp Pro Arg Arg Leu Val 
4 3 5 4 4 0 4 4 5 

CAG CTG CTC CGC CAG CAC AGC AGC CCC TGG CAG GTG TAC GGC TTC GTG 14 50 

Gin Leu Leu Arg Gin His Ser Ser Pro Trp Gin Val Tyr Gly Phe Val 
450 455 460 465 

CGG GCC TGC CTG CGC CGG CTG GTG CCC CCA GGC CTC TGG GGC TCC AGG 14 98 

Arg Ala Cvs Leu Arg Arg Leu Val Pro Pro Gly Leu Trp Gly Ser Arg 
470 475 430 

CAC AAC GAA CGC CGC TTC CTC AGG AAC ACC AAG AAG TTC ATC TCC CTG 154 6 

His Asn Glu Arg Arg Phe Leu Arg Asn Thr Lys Lys Phe He Ser Leu 
485 490 495 

GGG AAG CAT GCC AAG CTC TCG CTG CAG GAG CTG ACG TGG AAG ATG AGC 1594 
Giy Lys His Ala Lys Leu Ser Leu Gin Glu Leu Thr Trp Lys Met Ser 
500 505 510 
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GTG CGG GAC TGC GCT TGG CTG CGC AGG AGC CCA GGG GTT GGC TGT GTT 164 2 

Val Arg Asp Cys Ala Trp Leu Arg Arg Ser Pro Gly Val Gly Cys Val 
515 520 525 

CCG GCC GCA GAG CAC CGT CTG CGT GAG GAG ATC CTG GCC AAG TTC CTG 1690 
Pro Ala Ala Glu His Arg Leu Arg Glu Glu lie Leu Ala Lys Phe Leu 
530 535 540 545 

CAC TGG CTG ATG AGT GTG TAC GTC GTC GAG CTG CTC AGG TCT TTC TTT 17 38 

His Trp Leu Met Ser Val Tyr Val Val Glu Leu Leu Arg Ser Phe Phe 
550 555 560 

TAT GTC ACG GAG ACC ACG TTT CAA AAG AAC AGG CTC TTT TTC TAC CGG 1786 
Tyr Val Thr Glu Thr Thr Phe Gin Lys Asn Arg Leu Phe Phe Tyr Arg 
565 570 575 

AAG AGT GTC TGG AGC AAG TTG CAA AGC ATT GGA ATC AGA CAG CAC TTG 183 4 

Lys Ser Val Trp Ser Lys Leu Gin Ser lie Gly He Arg Gin His Leu 
580 585 590 

AAG AGG GTG CAG CTG CGG GAG CTG TCG GAA GCA GAG GTC AGG CAG CAT 188 2 

Lys Arg Val Gin Leu Arg Glu Leu Ser Glu Ala Glu Val Arg Gin His 
595 600 605 

CGG GAA GCC AGG CCC GCC CTG CTG ACG TCC AGA CTC CGC TTC ATC CCC 1930 
Arg Glu Ala Arg Pro Ala Leu Leu Thr Ser Arg Leu Arg Phe He Pro 
610 615 620 625 

AAG CCT GAC GGG CTG CGG CCG ATT GTG AAC ATG GAC TAC GTC GTG GGA 197 8 

Lys Pro Asp Gly Leu Arg Pro He Val Asn Met Asp Tyr Val Val Gly 
630 635 640 

GCC AGA ACG TTC CGC AGA GAA AAG AGG GCC GAG CGT CTC ACC TCG AGG 202 6 

Ala Arg Thr Phe Arg Arg Glu Lys Arg Ala Glu Arg Leu Thr Ser Arg 
645 650 655 

GTC AAG GCA CTG TTC AGC GTG CTC AAC TAC GAG CGG GCG CGG CGC CCC 20 7 4 

Val Lys Ala Leu Phe Ser Val Leu Asn Tyr Glu Arg Ala Arg Arg Pro 
660 665 670 

GGC CTC CTG GGC GCC TCT GTG CTG GGC CTG GAC GAT ATC CAC AGG GCC 2122 
Gly Leu Leu Gly Ala Ser Val Leu Gly Leu Asp Asp lie His Arg Ala 
675 680 685 

TGG CGC ACC TTC GTG CTG CGT GTG CGG GCC CAG GAC CCG CCG CCT GAG 2170 
Trp Arg Thr Phe Val Leu Arg Val Arg Ala Gin Asp Pro Pro Pro Glu 
690 695 700 705 

CTG TAC TTT GTC AAG GTG GAT GTG ACG GGC GCG TAC GAC ACC ATC CCC 2218 
Leu Tyr Phe Val Lys Val Asp Val Thr Gly Ala Tyr Asp Thr He Pro 
710 715 720 

CAG GAC AGG CTC ACG GAG GTC ATC GCC AGC ATC ATC AAA CCC CAG AAC 22 66 

Gin Asp Arg Leu Thr Glu Val He Ala Ser He He Lys Pro Gin Asn 
725 730 735 

ACG TAC TGC GTG CGT CGG TAT GCC GTG GTC CAG AAG GCC GCC CAT GGG 2314 
Thr Tyr Cys Val Arg Arg Tyr Ala Val Val Gin" Lys Ala Ala His Gly 
740 745 750 




CAC GTC CGC AAG GCC TTC AAG AGC CAC GTC CTA CGT CCA GTG CCA GGG 2362 
His Val Arg Lys Ala Phe Lys Ser His Val Leu Arg Pro Val Pro Gly 
755 760 765 

GAT CCC GCA GGG CTC CAT CCT CTC CAC GCT GCT CTG CAG CCT GTG CTA 2410 
Asp Pro Ala Gly Leu His Pro Leu His Ala Ala Leu Gin Pro Val Leu 
770 775 780 785 

CGG CGA CAT GGA GAA CAA GCT GTT TGC GGG GAT TCG GCG GGA CGG GCT 24 58 
Arg Arg His Gly Glu Gin Ala Val Cys Gly Asp Ser Ala Gly Arg Ala 
790 795 800 

GCT CCT GCG TTT GGT GGA TGATTTCTTG TTGGTGACAC CTCACCTCAC 250 6 
Ala Pro Ala Phe Gly Gly 
805 

CCACGCGAAA ACCTTCCTCA GGACCCTGGT CCGAGGTGTC CCTGAGTATG GCTGCGTGGT 2566 

GAACTTGCGG AAGACAGTGG TGAACTTCCC TGTAGAAGAC GAGGCCCTGG GTGGCACGGC 2 62 6 

TTTTGTTCAG ATGCCGGCCC ACGGCCTATT CCCCTGGTGC GGCCTGCTGC TGGATACCCG 2 68 6 

GACCCTGGAG GTGCAGAGCG ACTACTCCAG CTATGCCCGG ACCTCCATCA GAGCCAGTCT 27 4 6 

CACCTTCAAC CGCGGCTTCA AGGCTGGGAG GAACATGCGT CGCAAACTCT TTGGGGTCTT 2806 

GCGGCTGAAG TGTCACAGCC TGTTTCTGGA TTTGCAGGTG AACAGCCTCC AGACGGTGTG 2866 

CACCAACATC 7 AC AAG AT CC TCCTGCTGCA GGCGTACAGG TTTCACGCAT GTGTGCTGCA 2 92 6 

GCTCCCATTT CATCAGCAAG TTTGGAAGAA CCCCACATTT TTCCTGCGCG TCATCTCTGA 2 98 6 

CACGGCCTCC CTCTGCTACT CCATCCTGAA AGCCAAGAAC GCAGGGATGT CGCTGGGGGC 304 6 

CAAGCGCGCC GCCGGCCCTC TGCCCTCCGA GGCCGTGCAG TGGCTGTGCC ACCAAGCATT 310 6 

CCTGCTCAAG CTG AC TCG AC ACCGTGTCAC CTACGTGCCA CTCCTGGGGT CACTCAGGAC 3166 

AGCCCAGACG CAGCTGAGTC GGAAGCTCCC GGGGACGACG CTGACTGCCC TGGAGGCCGC 322 6 

AGCCAACCCG GCACTGCCCT CAGACTTCAA GACCATCCTG GACTGATGGC CACCCGCCCA 328 6 

CAGCCAGGCC GAG AGC AG AC ACCAGCAGCC CTGTCACGCC GGGCTCTACG TCCCAGGGAG 3 34 6 

GGAGGGGCGG CCCACACCCA GGCCCGCACC GCTGGGAGTC TGAGGCCTGA GTGAGTGTTT 34 0 6 

GGCCGAGGCC TGCATGTCCG GCTGAAGGCT GAGTGTCCGG CTGAGGCCTG AGCGAGTGTC 34 66 

CAGCCAAGGG CTGAGTGTCC AGCACACCTG CCGTCTTCAC TTCCCCACAG GCTGGCGCTC 352 6 

GGCTCCACCC CAGGGCCAGC TTTTCCTCAC CAGGAGCCCG GCTTCCACTC CCCACATAGG 3586 

AATAGTCCAT CCCCAGATTC GCCATTGTTC ACCCCTCGCC CTGCCCTCCT TTGCCTTCCA 364 6 

CCCCCACCAT CCAGGTGGAG ACCCTGAGAA GGACCCTGGG AGCTCTGGGA ATTTGGAGTG 370 6 

ACCAAAGGTG TGCCCTGTAC ACAGGCGAGG ACCCTGCACC TGGATGGGGG TCCCTGTGGG 37 66 

TCAAATTGGG GGGAGGTGCT GTGGGAGTAA AATACTGAAT ATATGAGTTT TTCAGTTTTG 382 6 

AAAAJVAAAAA AAAAAAAAAA AAAAAAAAA 3855 
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(2) INFORMATION FOR SEQ ID NO:5: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 807 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 5 : 

Met Pro Arg Ala Pro Arg Cys Arg Ala Val Arg Ser Leu Leu Arg Ser 
15 10 15 

His Tyr Arg Glu Val Leu Pro Leu Ala Thr Phe Val Arg Arg Leu Gly 
20 25 30 

Pro Gin Gly Trp Arg Leu Val Gin Arg Gly Asp Pro Ala Ala Phe Arg 
35 40 45 

Ala Leu Val Ala Gin Cys Leu Val Cys Val Pro Trp Asp Ala Arg Pro 
50 55 60 

Pro Pro Ala Ala Pro Ser Phe Arg Gin Val Ser Cys Leu Lys Glu Leu 
65 70 75 80 

Val Ala Arc Val Leu Gin Arg Leu Cys Glu Arg Gly Ala Lys Asn Val 
85 90 95 

Leu Ala Phe Gly Phe Ala Leu Leu' Asp Gly Ala Arg Gly Gly Pro Pro 
100 105 110 

Glu Ala Phe Thr Thr Ser Val Arg Ser Tyr Leu Pro Asn Thr Val Thr 
115 120 * 125 

Aso A L a Leu Arg Gly Ser Gly Ala Trp Gly Leu Leu Leu Arg Arg Val 
130 135 140 

Giy Asp Aso Val Leu Val His Leu Leu Ala Arg Cys Ala Leu Phe Val 
145 ' 150 155 160 

Leu Val Ala Pro Ser Cys Ala Tvr Gin Val Cys Gly Pro Pro Leu Tyr 
165 ' 170 175 

Gin Leu Gly Ala Ala Thr Gin Ala Arg Pro Pro Pro His Ala Ser Gly 
180 185 190 

Pro Arg Arg Arg Leu Gly Cys Glu Arg Ala Trp Asn His Ser Val Arg 
195 200 205 

Glu Ala Glv Val Pro Leu Gly Leu Pro Ala Pro Gly Ala Arg Arg Arg 
210 " 215 220 

Glv Giy Ser Ala Ser Arg Ser Leu Pro Leu Pro Lys Arg Pro Arg Arg 
225 230 235 240 

Glv Ala Ala Pro Glu Pro Glu Arg Thr Pro Val Gly Gin Gly Ser Trp 
245 250 255 

Ala His Pro Giy Arg Thr Arg Giy Pro Ser Asp Arg Gly Phe Cys Val 
260 265 270 
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Val Ser Pro Ala Arg Pro Ala Glu Glu Ala Thr Ser Leu Glu Gly Ala 
275 280 285 

Leu Ser Gly Thr Arg His Ser His Pro Ser Val Gly Arg Gin His. His 
290 295 300 

Ala Gly Pro Pro Ser Thr Ser Arg Pro Pro Arg Pro Trp Asp Thr Pro 
305 310 315 320 

Cys Pro Pro Val Tyr Ala Glu Thr Lys His Phe Leu Tyr Ser Ser Gly 
325 330 335 

Asp Lys Glu Gin Leu Arg Pro Ser Phe Leu Leu Ser Ser Leu Arg Pro 
340 345 350 

Ser Leu Thr Gly Ala Arg Arg Leu Val Glu Thr He Phe Leu Gly Ser 
355 - 360 365 

Arg Pro Trp Met Pro Gly Thr Pro Arg Arg Leu Pro Arg Leu Pro Gin 
370 ' 375 380 

Ara Tyr Tro Gin Met Arg Pro Leu Phe Leu Glu Leu Leu Gly Asn His 
385 ' 390 395 400 

Ala Gin Cys Pro Tyr Gly Val Leu Leu Lys Thr His Cys Pro Leu Arg 
405 410 415 

A La Ala Val Thr Pro Ala Ala Gly Val Cys Ala Arg Glu Lys Pro Gin 
420 425 430 

Gly Ser Val Ala Ala Pro Glu Glu Glu Asp Thr Asp Pro Arg Arg Leu 
4 3 5 4 4 0 4 4 5 

val Gin Leu Leu Arg Gin His Ser Ser Pro Trp Gin Val Tyr Gly Phe 
4 50 4 55 4 60 

Val Arg Ala Cys Leu Arg Arg Leu Val Pro Pro Gly Leu Trp Gly Ser 
4 65 470 475 430 

Arg His Asn Glu Arg Arg Phe Leu Arg Asn Thr Lys Lys Phe lie Ser 
485 490 495 

Leu Gly Lys His Ala Lys Leu Ser Leu Gin Glu Leu Thr Trp Lys Met 
500 505 510 

Ser Val Arg Asp Cys Ala Trp Leu Arg Arg Ser Pro Gly Val Gly Cys 
• 515 520 525 

Val Pro Ala Ala Glu His Arg Leu Arg Glu Glu He Leu Ala Lys Phe 
530 535 540 

Leu His Tro Leu Met Ser Val Tyr Val Val Glu Leu Leu Arg Ser Phe 
545 " 550 555 560 

Phe Tyr Val Thr Glu Thr Thr Phe Gin Lys Asn Arg Leu Phe Phe Tyr 
565 570 575 



Arg Lys Ser Val Trp Ser Lys Leu Gin Ser He Gly He Arg Gin His 
580 * 585 590 
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Leu Lys Arg Vai Gin Leu Arg Glu Leu Ser Glu Ala Glu Vai Arg Gin 
595 600 605 

His Arg Glu Ala Arg Pro Ala Leu Leu Thr Ser Arg Leu Arg Phe lie 
610 615 620 

Pro Lys Pro Asp Gly Leu Arg Pro He Vai Asn Met Asp Tyr Vai Vai 
625 630 635 640 

Gly Ala Arg Thr Phe Arg Arg Glu Lys Arg Ala Glu Arg Leu Thr Ser 
645 650 655 

Ara Vai Lys Ala Leu Phe Ser Vai Leu Asn Tyr Glu Arg Ala Arg Arg 
660 665 670 

Pro Gly Leu Leu Gly Ala Ser Vai Leu Gly Leu Asp Asp He His Arg 
675 . 680 685 

Ala Trp Arg Thr Phe Vai Leu Arg Vai Arg Ala Gin Asp Pro Pro Pro 
690 695 700 

G'u Leu Tyr Phe Vai Lys Vai Asp Vai Thr Gly Ala Tyr Asp Thr He 
705 710 715 720 

?-o Gin Aso Arg Leu Thr Glu Vai lie Ala Ser He He Lys Pro Gin 
725 730 735 

Asp. Thr Tyr Cys Vai Arg Arg Tyr Ala Vai Vai Gin Lys Ala Ala His 
740 745 750 

Gly His Vai Arg Lys Ala Phe Lys Ser His Vai Leu Arg Pro Vai Pro 
755 760 765 

Glv Aso Pro Ala Gly Leu His Pro Leu His Ala Ala Leu Gin Pro Vai 

770 775 730 

a^g Arg His Gly Glu Gin Ala Vai Cvs Gly Asp Ser Ala Gly Arg 

735 790 795 800 



Ala Ala Pro Ala Phe Gly Gly 
805 



(2) INFORMATION FOR SEQ ID NO : 6 : 

(l) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 4335 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(n) MOLECULE TYPE : DMA (genomic) 



(i;<) FEATURE: 

(A) NAME /KEY: - 

(B) LOCATION: i . .4 335 

(D) OTHER INFORMATION: /note= "genomic DNA insert of pGRN144 M 



(ix) FEATURE: 

(A) NAME /KEY : intron 

(B) LOCATION: 2715.. 2818 

(D) OTHER INFORMATION: /note- "intron 1" 

(ix) FEATURE: 

(A) NAME /KEY : intron 

(B) LOCATION: 4173.. 4326 

(D) OTHER INFORMATION: /note= "intron 2" 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 6: 

CCATGGGACC CACTGCAGGG GCAGCTGGGA GGCTGCAGGC TTCAGGTCCC AGTGGGGTTG 60 

CCATCTGCCA GTAGAAACCT GATGTAGAAT CAGGGCGCGA GTGTGGACAC TGTCCTGAAT 120 

CTCAATGTCT CAGTGTGTGC TGAAACATGT AGAAATTAAA GTCCATCCCT CCTACTCTAC 180 

TGGGATTGAG CCCCTTCCCT ATCCCCCCCC AGGGGCAGAG GAGTTCCTCT CACTCCTGTG 24 0 

GAGGAAGGAA TGATACTTTG TTATTTTTCA CTGCTGGTAC TGAATCCACT GTTTCATTTG 300 

TTGGTTTGTT TGTTTTGTTT TGAGAGGCGG TTTCACTCTT GTTGCTCAGG CTGGAGGGAG 360 

TGCAATGGCG CGATCTTGGC TTACTGCAGC CTCTGCCTCC CAGGTTCAAG TGATTCTCCT 4 20 

GCTTCCGCCT CCCATTTGGC TGGGATTACA GGCACCCGCC ACCATGCCCA GCTAATTTTT 480 

TGTATTTTTA G T AG AG AC GG GGGTGGGGGT GGGGTTCACC ATGTTGGCCA GGCTGGTCTC 540 

GAACTTCTGA CCTCAGATGA TCCACCTGCC TCTGCCTCCT AAAGTGCTGG GATTACAGGT 600 

GTGAGCCACC ATGCCCAGCT CAGAATTTAC TCTGTTTAGA AACATCTGGG TCTGAGGTAG 660 

GAAGCTCACC CCACTCAAGT GTTGTGGTGT TTTAAGCCAA TGATAGAATT TTTTTATTGT 720 

TGTTAGAACA CTCTTGATGT TTTACACTGT GATGACTAAG AC AT CATC AG CTTTTCAAAG 780 

ACACACTAAC TGCACCCATA ATACTGGGGT GTCTTCTGGG TATCAGCGAT CTTCATTGAA 840 

TGCCGGGAGG CGTTTCCTCG CCATGCACAT GGTGTTAATT ACTCCAGCAT AATCTTCTGC 900 

TTCCATTTCT TCTCTTCCCT' CTTTTAAAAT TGTGTTTTCT ATGTTGGCTT CTCTGCAGAG 960 

AACCAGTGTA AGCTACAACT TAACTTTTGT TGGAACAAAT TTTCCAAACC GCCCCTTTGC 1020 

CCTAGTGGCA GAGACAATTC ACAAACACAG CCCTTTAAAA AGGCTTAGGG ATCACTAAGG 108 0 

GG AT TTC TAG AAGAGCGACC CGTAATCCTT AAGTATTTAC AAGACGAGGC TAACCTCCAG 114 0 

CGAGCGTGAC AGCCCAGGGA GGGTGCGAGG CCTGTTCAAA TGCTAAGCTT CCATAAATAA 1200 

AGCAAATTTC CTCCGGCAGT TTCTGGAAAG TAGGAAAGGT TAACATTTAA GGTTGCGTTT 12 60 

GTTAGCATTT CAGTGTTTGC CGACCTCAGC TAACAGCATC CCTGCAAGGC CTCGGGAGAC 1320 

CCAGAAGTTT CTCGCCCCTT AGATCCAAAC TTGAGCAACC CGGAGTCTGG ATTCCTGGGA 138 0 

AGTCCTCAGC TGTCCTGCGG TTGTGCCGGG GCCCCAGGTC TGGAGGGGAC CAGTGGCCGT 14 40 




GTGGCTTCTA 


CTGCTGGGCT 


GGAAGTCGGG 


CCTCCTAGCT 


CTGCAGTCCG 


AGGCTTGGAG 


1500 


CCAGGTGCCT 


GGACCCCGAG 


GCTGCCCTCC 


ACCCTGTGCG 


GGCGGGATGT 


GACCAGATGT 


1560 


TGGCCTCATC 


TGCCAGACAG 


AGTGCCGGGG 


CCCAGGGTCA 


AGGCCGTTGT 


GGCTGGTGTG 


1620 


AGGCGCCCGG 


TGCGCGGCCA 


GCAGGAGCGC 


CTGGCTCCAT 


TTCCCACCCT 


TTCTCGACGG 


1680 


GACCGCCCCG 


GTGGGTGATT 


AACAGATATT 


GGGGTGGTTT 


GCTCATGGTG 


GGGACCCCTT 


1740 


CGCCGCCTGA 


GAACCTGCAA 


AGAGAAATGA 


CGGGCCTGTG 


TCAAGGAGCC 


CAAGTCGCGG 


1800 


GGAAGTGTTG 


CAGGGAGGCA 


CTCCGGGAGG 


TCCCGCGTGC 


CCGTCCAGGG 


AGCAATGCGT 


1860 


CCTCGGGTTC 


GTCCCCAGCC 


GCGTCTACGC 


GCCTCCGTCC 


TCCCCTTCAC 


GTCCGGCATT 


1920 


CGTGGTGCCC 


GGAGCCCGAC 


GCCCCGCGTC 


CGGACCTGGA 


GGCAGCCCTG 


GGTCTCGGGA 


1980 


TCAGGCCAGC 


GGCCAAAGGG 


TCGCCGCACG 


CACCTGTTCC 


CAGGGCCTCC 


ACATCATGGC 


2040 


CCCTCCCTCG 


GGTTACCCCA 


CAGCCTAGGC 


CGATTCGACC 


TCTCTCCGCT 


GGGGCCCTCG 


2100 


CTGGCGTCCC 


TGCACCCTGG 


GAGCGCGAGC 


GGCGCGCGGG 


CGGGGAAGCG 


CGGCCCAGAC 


2160 


CCCCGGGTCC 


GCCCGGAGCA 


GCTGCGCTGT 


CGGGGCCAGG 


CCGGGCTCCC 


AGTGGATTCG 


2220 


CGGGCAACAG 


ACGCCCAGGA 


CCGCGCTTCC 


CACGTGGCGG 


AGGGACTGGG 


GACCCGGGCA 


2280 


CCGGTCCTGC 


CCCTTCACCT 


TCCAGCTCCG 


CCTCGTCCGC 


GCGGAACCCC 


GCCCCGTCCC 


2340 


GAACCCTTCC 


CGGGTCCCCG 


GCCCAGCCCC 


TTCCGGGCCA 


TCCCAGCCCG 


TCCCGTTCCT 


■ 2400 


TTTCCGCGGC 


CCCGCCCTCT 


CCTCGCGGCG 


CGAGTTTCAG 


GCAGCGCTGC 


GTCCTGCTGC 


2460 


GCACGTGGGA 


AGCCCTGGCC 


CCGGCCACCC 


CCGCGATGCC 


GCGCGCTCCC 


CGCTGCCGAG 


2520 


CCGTGCGCTC 


CCTGCTGCGC 


AGCCACTACC 


GCGAGGTGCT 


GCCGCTGGCC 


ACGTTCGTGC 


2580 


GGCGCCTGGG 


GCCCCAGGGC 


TGGCGGCTGG 


TGCAGCGCGG 


GGACCCGGCG 


GCTTTCCGCG 


2640 


CGCTGGTGGC 


CCAGTGCCTG 


GTGTGCGTGC 


CCTGGGACGC 


ACGGCCGCCC 


CCCGCCGCCC 


2700 


CCTCCTTCCG 


CCAGGTGGGC 


CTCCCCGGGG 


TCGGCGTCCG 


GCTGGGGTTG 


AGGGCGGCCG 


2760 


GGGGGAACCA 


GCGACATGCG 


GAGAGCAGCG 


CAGGCGACTC 


AGGGCGCTTC 


CCCCGCAGGT 


2820 


GTCCTGCCTG 


AAGGAGCTGG 


TGGCCCGAGT 


GCTGCAGAGG 


CTGTGCGAGC 


GCGGCGCG.AA 


2880 


GAACGTGCTG 


GCCTTCGGCT 


TCGCGCTGCT 


GGACGGGGCC 


CGCGGGGGCC 


CCCCCGAGGC 


2940 


CTTCACCACC 


AGCGTGCGCA 


GCTACCTGCC 


CAACACGGTG 


ACCGACGCAC 


TGCGGGGGAG 


3000 


CGGGGCGTGG 


GGGCTGCTGC 


TGCGCCGCGT 


GGGCGACGAC 


GTGCTGGTTC 


ACCTGCTGGC . 


3060 


ACGCTGCGCG 


CTCTTTGTGC 


TGGTGGCTCC 


CAGCTGCGCC 


TACCAGGTGT 


GCGGGCCGCC 


3120 




CTCGG^GCTG 


CCACTCAGGC 


CCGGCCCCCG 


CCACACGCTA 


GTGGACCCCG 


3180 


AAGGCGTCTG 


GGATGCGAAC 


GGGCCTGGAA 


CCATAGCGTC 


AGGGAGGCCG 


GGGTCCCCCT 


3240 


GGGCCTGCCA 


GCCCCGGGTG 


CGAGGAGGCG 


CGGGGGCAGT 


GCCAGCCGAA 


GTCTGCCGTT 


3300 



GCCCAAGAGG CCCAGGCGTG GCGCTGCCCC TGAGCCGGAG CGGACGCCCG TTGGGCAGGG 3360 

GTCCTGGGCC CACCCGGGCA GGACGCGTGG ACCGAGTGAC CGTGGTTTCT GTGTGGTGTC 34 20 

ACCTGCCAGA CCCGCCGAAG AAGCCACCTC TTTGGAGGGT GCGCTCTCTG GCACGCGCCA 3480 

CTCCCACCCA TCCGTGGGCC GCCAGCACCA CGCGGGCCCC CCATCCACAT CGCGGCCACC 3540 

ACGTCCCTGG GACACGCCTT GTCCCCCGGT GTACGCCGAG ACCAAGCACT TCCTCTACTC 3600 

CTCAGGCGAC AAGGAGCAGC TGCGGCCCTC CTTCCTACTC AGCTCTCTGA GGCCCAGCCT 3660 

GACTGGCGCT CGGAGGCTCG TGGAGACCAT CTTTCTGGGT TCCAGGCCCT GGATGCCAGG 3720 

GACTCCCCGC AGGTTGCCCC GCCTGCCCCA GCGCTACTGG CAAATGCGGC CCCTGTTTCT 37 80 

GGAGCTGCTT GGGAACCACG CGCAGTGCCC CTACGGGGTG CTCCTCAAGA CGCACTGCCC 38 4 0 

GCTGCGAGCT GCGGTCACCC CAGCAGCCGG TGTCTGTGCC CGGGAGAAGC CCCAGGGCTC 3900 

TGTGGCGGCC CCCGAGGAGG AGGACACAGA CCCCCGTCGC CTGGTGCAGC TGCTCCGCCA 3960 

GCACAGCAGC CCCTGGCAGG TGTACGGCTT CGTGCGGGCC TGCCTGCGCC GGCTGGTGCC 4020 

CCCAGGCCTC TGGGGCTCCA GGCACAACGA ACGCCGCTTC CTCAGGAACA CCAAGAAGTT 4 080 

CATCTCCCTG GGGAAGCATG CCAAGCTCTC GCTGCAGGAG CTGACGTGGA AGATGAGCGT 414 0 

GCGGGACTGC GCTTGGCTGC GCAGGAGCCC AGGTGAGGAG GTGGTGGCCG TCGAGGGCCC 4200 

AGGCCCCAGA GCTGAATGCA GTAGGGGCTC AGAAAAGGGG GCAGGCAGAG CCCTGGTCCT 4 2 60 

CCTGTCTCCA TCGTCACGTG GGCACACGTG GCTTTTCGCT CAGGACGTCG AGTGGACACG 4 320 

GTGATCGAGG TCGAC 4335 

[2) INFORMATION FOR SEQ ID MO : 7 : 

<i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 240 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS. : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DMA (genomic) 

(ix) FEATURE : 

(A)' NAME/KEY: intron 
(3) LOCATION: 95 . . 198 

(D) OTHER INFORMATION: /note= "intron 1" 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 7 : 

GGACCCGGCG GCTTTCCGCG CGCTGGTGGC CCAGTGCCTG GTGTGCGTGC CCTGGGACGC 60 

ACGGCCGCCC CCCGCCGCCC CCTCCTTCCG CCAGGTGGGC CTCCCCGGGG TCGGCGTCCG 120 

GCTGGGGTTG AGGGCGGCCG GGGGGA^CCA GCGACATGCG GAGAGCAGCG CAGGCGACTC 18 C 




AGGGCGCTTC CCCCGCAGGT GTCCTGCCTG AAGGAGCTGG TGGCCCGAGT GCTGCAGAGG 24 0 



(2) INFORMATION FOR SEQ ID NO : 8 : 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 389 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 
{ D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 



(ix) FEATURE: 

(A) NAME /KEY : - 

(B) LOCATION: 1. .389 

(D) OTHER INFORMATION: /note= "expressed sequence tag (EST) 

AA281296" 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 8: 
GCCAAGTTCC TGCACTGGCT GATGAGTGTG TACGTCGTCG AGCTGCTCAG GTCTTTCTTT 60 
TA7GTCACGG AGACCACGTT TCAAAAGAAC AGGCTCTTTT TCTACCGGAA GAGTGTCTGG 120 
AGCAAGTTGC AAAGCATTGG AATCAGACAG CACTTGAAGA GGGTGCAGCT GCGGGACGTG 180 
TCGGAAGCAG AGGTCAGGCA GCATCGGGAA GCCAGGCCCG CCCTGCTGAC GTCCAGACTC 240 
CGCTTCATCC CCAAGCCTGA CGGGCTGCGG CCGATTGTGA AC AT GG ACT A CGTCGTGGGA 300 
GCCAGAACGT TCCGCAGAGA AAAGAGGGCC GAGCGTCTCA CCTCGAGGGT GAAGGCACTG 360 
TTCAGCGTGC TCAACTACGA GCGGGCGCG 389 



(2) INFORMATION FOR SEQ ID NO : 9 : 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 182 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 



(ix) FEATURE: 

(A) NAME/KEY: - 

(B) LOCATION: 1 - - 182 

(D) OTHER INFORMATION: /note= "182 basepair sequence deleted in 

clone 712562" 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 9 : 
TCTACCTTGA CAGACCTCCA GCCGTACATG CGACAGTTCG TGGCTCACCT GCAGGAGACC 60 



AGCCCGCTGA GGGATGCCGT CGTCATCGAG CAGAGCTCCT CCCTGAATGA GGCCAGCAGT 120 



25 



GGCCTCTTCG ACGTCTTCCT ACGCTTCATG TGCCACCACG CCGTGCGCAT CAGGGGCAAG 180 
TC 182 



(2) INFORMATION FOR SEQ ID NO: 10: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 259 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: 

(D) TOPOLOGY: linear 

( ii ) MOLECULE TYPE: protein 



(ix) FEATURE: 

(A) NAME /KEY : Protein 

(B) LOCATION: 1 . . 259 

(D) OTHER INFORMATION: /note- "protein encoded by clone 712562" 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 10: 

M^t Val Tyr Val Val Glu Leu Leu Arg Ser Phe Phe Tyr Val Thr 

15 10 15 

Glu Th*- Thr Phe Gin Lys Asn Arg Leu Phe Phe Tyr Arg Lys Ser Val 
20 25 30 

Trp Ser Lys Leu Gin Ser lie Gly lie Arg Gin His Leu Lys Arg Val 
3 5 4 0 4 5 

Gin Leu Arg Glu Leu Ser Glu Ala Glu Val Arg Gin His Arg Glu Ala 
50 55 60 

Arg Pro Ala Leu Leu Thr Ser Arg Leu Arg Phe lie Pro Lys Pro Asp 

65 70 75 80 

Gly Leu Arg Pro He Val Asn Met Asp Tyr Val Val Gly Ala Arg Thr 
85 90 95 

Phe Arg Arg Glu Lys Arg Ala Glu Arg Leu Thr Ser Arg Val Lys Ala 
100 105 110 



Leu Phe Ser Val Leu Asn Tyr Glu 
115 . 120 

Gly Ala Ser Val Leu Gly Leu Asp 
130 135 

Phe Val Leu Arg Val Arg Ala Gin 
145 150 

Val Lys Val Asp Val Thr Gly Ala 
165 



Arg Ala Arg Arg Pro Gly Leu Leu 
125 

Asp He His Arg Ala Trp Arg Thr 
140 

Aso Pro Pro Pro Giu Leu Tyr Phe 

155 160 

Tyr Asp Thr He Pro Gin Asp Arg 
170 175 



Leu Thr- Glu Val He Ala Ser He He Lys Pro Gin Asn Thr Tyr Cys 
180 185 190 



26 



Val Arg Arg Tyr Ala Val Val Gin Lys Ala Ala His Gly His Val Arg 
195 200 205 

Lys Ala Phe Lys Ser His Val Leu Arg Pro Val Pro Gly Asp Pro Ala 
210 215 220 

Gly Leu His Pro Leu His Ala Ala Leu Gin Pro Val Leu Arg Arg His 
225 230 235 240 

Gly Glu Gin Ala Val Cys Gly Asp Ser Ala Gly Arg Ala Ala Pro Ala 
245 250 255 

Phe Gly Gly 



(2) INFORMATION FOR SEQ ID NO : 1 1 : 

(i) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 34 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: peptide 



(ix) FEATURE: 

(A) NAME/KEY: Modi f ied-s ite 

(B) LOCATION: 2 

(D) OTHER INFORMATION: /product= "OTHER" 

/note- "Xaa = Leu or lie" 



(L:-:) FEATURE: 

(A) NAME/ KEY : Modi f ied-s ite 

(B) LOCATION: 10 

{ D ) OTHER INFORMATION: /product- "OTHER" 

/note- "Xaa = Leu or lie" 



( A ) NAM E/KEY : Modified 

{5} LOCATION: 11 

(D) OTHER INFORMATION: 



site 

/product- "OTHER" 

/note- "Xaa = Leu or lie" 



(ix) FEATURE: 

(A) NAME /KEY: Modified 

(B) LOCATION: 12 

(D) OTHER INFORMATION: 



site 

/product- "OTHER" 

/note- "Xaa - Gin or Arg" 



(ix) FEATURE: 

(A) NAME/KEY: Modi f ied-s ite 

(B) LOCATION: 28 

(D) OTHER INFORMATION: /product^ "OTHER" 

/note- "Xaa = Phe or Tyr" 



(ix) FEATURE: 

(A) NAME/KEY: Modi f ied-si te 

(B) LOCATION: 29 

(D) OTHER INFORMATION: /product- "OTHER" 

/note- "Xaa = Phe or Tyr" 



27 



(ix) FEATURE: 

(A) NAME/KEY: Modi f ied-si te 

(B) LOCATION: 31 

(D) OTHER INFORMATION: /product- "OTHER" 

/note- "Xaa - Lys or His" 



Cxi) SEQUENCE DESCRIPTION: SEQ ID NO: 11: 

Trp Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Phe Phe Trp 
15 10 15 

Xaa Thr Glu Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Arg Xaa Xaa 
20 25 30 

Xaa Trp 



(2) INFORMATION FOR SEQ ID NO: 12: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 35 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : 

CD) TOPOLOGY: linear 



(ii) MOLECULE TYPE: peptide 



(ix) FEATURE: 

{ A) NAME /KEY : Modi f ied-si te 
(3) LOCATION: 2 

{ D ) OTHER INFORMATION: /product= "OTHER" 

/note= "Xaa = Leu or lie" 

(i:-:) FEATURE: 

(A) NAME /KEY": Mod i f ied-s i te 

(B) LOCATION: 10 

(D) OTHER INFORMATION: /product- "OTHER" 

/note= "Xaa = Leu or He" 

(ix) FEATURE: 

(A) NAME /KEY : Modi f ied-s i te 

(B) LOCATION: 11 

(D) OTHER INFORMATION: /product- "OTHER" 

/note- "Xaa = Leu or lie" 

(ix) FEATURE: 

(A) NAME /KEY : Modi f ied-s ite 

(B) LOCATION: 12 

(D) OTHER INFORMATION: /product^ "OTHER" 

/note= "Xaa - Gin or Arg" 

Cix) FEATURE : 

{ A) NAME /KEY : Modi f ied-si te 
(B) LOCATION: 2 9 

(D) OTHER INFORMATION: /product- "OTHER" 

/note- "Xaa - Phe or Tyr" 



28 



(ix) FEATURE: 

(A) NAME/KEY: Modi f ied-s ite 

(B) LOCATION: 30 

(D) OTHER INFORMATION: /product= "OTHER" 

/note= "Xaa = Phe or Tyr* 

(ix) FEATURE: 

(A) NAME/KEY: Modi f ied-s ite 

(B) LOCATION: 32 

(D) OTHER INFORMATION: /product^ "OTHER" 

/note= "Xaa = Lys or His* 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 12: 

Trp Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Phe Phe Trp 
1-5 10 15 

Xaa Thr Glu Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Arg Xaa 
20 25 30 

Xaa Xaa Trp 
35 



INFORMATION FOR SEQ ID NO: 13: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 129 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : 

(D) TOPOLOGY: linear 



(ii) MOLECULE TYPE: peptide 



; ix) FEATURE : 

( A } NAM E / K E Y : Pe p t i de 
(B) LOCATION: 1 . - 129 

(D) OTHER INFORMATION: /note- "TRT motifs from human' 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 1 3 : 

Ala Lys Phe Leu His Trp Leu Met Ser Val Tyr Val Val Glu Leu Leu 
15 10 15 



Arg Ser Phe Phe 
20 

Phe Phe Tyr Arg 
35 

Arg Gin His Leu 

50 

Val Arg Gin His 
65 

Arg Phe lie Pro 



Tyr Val Thr Glu 



Lys Ser Val Trp 
4 0 

Lys Arg Val Gin 
55 

Arg Glu Ala Arg 
70 

Lys Pro Asp Gly 
85 



Thr Thr Phe Gin 
25 

Ser Lys Leu Gin 



Leu Arg Glu Leu 
60 

Pro Ala Leu Leu 
75 

Leu Arg Pro lie 
90 



Lys Asn Arg Leu 
30 

Ser He Gly He 
4 5 

Ser Glu Ala Glu 



Thr Ser Arc Leu 
80 

Val Asn Met Asp 
95 



29 



Tyr Val Val Gly Ala Arg Thr Phe Arg Arg Glu Lys Arg Ala Glu Arg 
100 105 110 

Leu Thr Ser Arg Val Lys Ala Leu Phe Ser Val Leu Asn Tyr Glu Arg 
115 120 125 

Ala 



(2) INFORMATION FOR SEQ ID NO: 14: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 233 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: peptide 



(ix) FEATURE: 

(A) NAME/KEY: Peptide 

(B) LOCATION: 1 . . 233 

(D) OTHER INFORMATION: /note= "TRT motifs from 

Schizosaccharomyces pombe tezl" 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 14: 

lie Ser Glu He Glu Trp Leu Val Leu Gly Lys Arg Ser Asn Ala Lys 
15 10 15 

Met Cys Leu Ser Asp Phe Glu Lys Arg Lys Gin lie Phe Ala Glu Phe 
20 ' 25 30 

He Tyr Trp Leu Tyr Asn Ser Phe He He Pro He Leu Gin Ser Phe 
35 40 45 

Phe Tyr He Thr Glu Ser Ser Asp Leu Arg Asn Arg Thr Val Tyr Phe 
50 55 60 

Arg Lys Asp He Trp Lys Leu Leu Cys Arg Pro Phe He Thr Ser Met 
65 ' 70 75 80 

Lys Met Glu Ala Phe Glu Lys He Asn Glu Asn Asn Val Arg Met Asp 
85 90 95 

Thr Gin Lys Thr Thr Leu Pro Pro Ala Val He Arg Leu Leu Pro Lys 
100 105 HO 

Lys Asn Thr Phe Arg Leu He Thr Asn Leu Arg Lys Arg Phe Leu He 
115 120 125 

Lys Met Gly Ser Asn Lys Lys Met Leu Val Ser Thr Asn Gin Thr Leu 
130 135 140 

Arg Pro Val Ala Ser He Leu Lys His Leu He Asn Glu Glu Ser Ser 
145 150 155 160 



Gly He Pro Phe Asn Leu Glu Val Tyr Met Lys Leu Leu Thr 
165 170 



Phe Lys 
17 5 



30 



Lys Asp Leu Leu Lys His Arg Met Phe Gly Arg Lys Lys Tyr Phe Val 
180 185 190 

Arg lie Asp lie Lys Ser Cys Tyr Asp Arg lie Lys Gin Asp Leu Met 
195 200 205 

Phe Arg lie Val Lys Lys Lys Leu Lys Asp Pro Glu Phe Val lie Arg 
210 215 220 

Lys Tyr Ala Thr He His Ala Thr Ser 

225 230 



(2) INFORMATION FOR SEQ ID NO: 15: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 233 amino acids 

(B) TYPE : amino acid 

(C) STRANDEDNESS: 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: peptide 



(i>:) FEATURE: 

(A) NAME /KEY : Peptide 
(3) LOCATION: 1..233 

(D) OTHER INFORMATION; /note= "TRT motifs from Sacchaxomyces 

cerevisiae EST2" 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 1 5 : 

Leu lys Asp Phe Arg Trp Leu Phe He Ser Asp He Trp Phe Thr Lys 
1 5 10 15 

His Asn Phe Glu Asn Leu Asn Gin Leu Ala lie Cys Phe He Ser Trp 
20 25 30 

Leu Phe Arg Gin Leu lie Pro Lys He lie Gin Thr Phe Phe Tyr Cys 
3 5 4 0 4 5 

Thr Glu He Ser Ser Thr Val Thr lie Val Tyr Phe Arg His Asp Thr 
50 55 60 

Trp Asn Lys Leu He Thr Pro Phe lie Val Glu Tyr Phe Lys Thr Tyr 
65 70 75 30 

Leu Val Glu Asn Asn Val Cys Arg Asn His Asn Ser Tyr Thr Leu Ser 
85 90 95 

Asn Phe Asn His Ser Lys Met Arg lie lie Pro Lys Lys Ser Asr. Asn 
100 105 110 

Glu Phe Arg lie He Ala He Pro Cys Arg Gly Ala Asp Glu Glu Glu 
115 120 ~ 125 

Phe Thr lie Tyr Lys Glu Asn His Lys Asn Ala lie Gin Pro Thr Gin 
130 135 140 




Lys He Leu Glu Tyr Leu Arg Asn Lys Arg Pro Thr Ser Phe Thr Lys 
145 150 155 160 

He Tyr Ser Pro Thr Gin He Ala Asp Arg He Lys Glu Phe Lys Gin 
165 170 175 

Arg Leu Leu Lys Lys Phe Asn Asn Val Leu Pro Glu Leu Tyr Phe Met 
180 185 190 



Lys Phe Asp Val Lys Ser Cys Tyr Asp Ser He Pro Arg Met Glu Cys 
195 200 205 

Met Arg He Leu Lys Asp Ala Leu Lys Asn Glu Asn Gly Phe Phe Val 
210 215 220 

Arg Ser Gin Tyr Phe Phe Asn Thr Asn 
225 230 



INFORMATION FOR SEQ ID NO: 16: 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 233 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : 

(D) TOPOLOGY: linear 

(li) MOLECULE TYPE: peptide 



(ix) FEATURE : 

(A) NAME /KEY : Peptide 

(B) LOCATION : 1 . . 233 

(D) OTHER INFORMATION: /note- M TRT motifs from Euplotes 

aedicuiatus pi 23" 



(xi) SEQUENCE DESCRIPTION: SEQ ID MO : 1 6 : 

Thr Ara Glu He Ser Trp Met Gin Val Glu Thr Ser Ala Lys His Phe 

1 ^ 5 10 15 

Tvr Tyr Phe Asp His Glu Asn He Tyr Val Leu Trp Lys Leu Leu Arg 

20 25 30 

Trp lie Phe Glu Asp Leu Val Val Ser Leu He Arg Cys Phe Phe Tyr 
35 4 0 45 

Val Thr Glu Gin Gin Lys Ser Tyr Ser Lys Thr Tyr Tyr Tyr Arg Lys 
50 55 60 



Asn He Tro Asp Val He Met Lvs Met Ser lie Ala Asp Leu Lys Lys 
65 70 75 80 

Glu Thr Leu Ala Glu Val Gin Glu Lys Glu Val Glu Glu Trp Lys Lys 

85 90 95 



Ser Leu Gly Phe Ala Pro Gly Lys Leu Arg Leu -He Pro Lys Lys Thr 
100 105 110 



32 ^ 



Thr Phe Arg Pro lie Met Thr Phe Asa Lys Lys lie Val Asn Ser Asp 
115 120 125 

Arg Lys Thr Thr Lys Leu Thr Thr Asn Thr Lys Leu Leu Asn Ser His 
130 135 140 

Leu Met Leu Lys Thr Leu Lys Asn Arg Met Phe Lys Asp Pro Phe Gly 
145 150 155 160 

Phe Ala Val Phe Asn Tyr Asp Asp Val Met Lys Lys Tyr Glu -Glu Phe 
165 170 175 

Val Cys Lys Trp Lys Gin Val Gly Gin Pro Lys Leu Phe Phe Ala Thr 
180 185 190 

Met Asp lie Glu Lys Cys Tyr Asp Ser Val Asn Arg Glu Lys Leu Ser 
195 ■ 200 205 

Thr Phe Leu Lys Thr Thr Lys Leu Leu Ser Ser Asp Phe Trp He Met 
210 215 220 

Thr Ala Gin He Leu Lys Arg Lys Asn 
225 230 



INFORMATION FOR SEQ ID MO: 17: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 4 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: 

(D) TOPOLOGY: linear 

{ii} MOLECULE TYPE: peptide 



(ix) FEATURE: 

(A) NAME /KEY : Peptide 

(B) LOCATION: 1 . . 4 

(D) OTHER INFORMATION: /note= "consensus telomerase Rv sequence 

from motif T " 

(ix) FEATURE: 

(A) NAME/KEY: Modi f ied-site 

(B) LOCATION: 1 

(D) OTHER INFORMATION: /product- "OTHER" 

/note= "Xaa = polar amino acid, Gly, 
Ser, Thr, Tyr, Cys, Asn or Gin" 



(xi) SEQUENCE DESCRIPTION: SEQ ID MO: 17: 



Xaa Phe Phe Tyr 
1 




(2) INFORMATION FOR SEQ ID NO: 18: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 7 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: peptide 



(ix) FEATURE: 

(A) NAME/KEY: Peptide 

(B) LOCATION: 1..7 

(D) OTHER INFORMATION : /note- "consensus telomerase RT sequence 

from motif 1" 

(ix) FEATURE: 

(A) NAME /KEY : Modi f ied-s i te 

(B) LOCATION: 1 

(D) OTHER INFORMATION: /product- "OTHER" 

/note= "Xaa = hydrophobic amino acid, 
Ala, Leu, He, Val, Pro, Phe, Trp 
or Met" 

(ix) FEATURE: 

(A) NAME /KEY : Modi f ied-s i t e 

(B) LOCATION: 3 

(D) OTHER INFORMATION: /product- "OTHER" 

/note= "Xaa = hydrophobic amino acid, 
Ala, Leu, He, Val, Pro, Phe, Trp 
or Met" 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 18: 

Xaa Arg Xaa He Pro Lys Lys 
1 5 



(2) INFORMATION FOR SEQ ID NO : 1 9 : 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 4 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: peptide 



(ix) FEATURE: 

(A) NAME/KEY: Peptide 

(B) LOCATION: 1 . . 4 

(D) OTHER INFORMATION : /note= "consensus telomerase RT sequence 

from motif 2" 




(ix) FEATURE: 

(A) NAME/KEY: Modif ied-site 

(B) LOCATION: 3 

(D) OTHER INFORMATION: /product^ "OTHER" 

/note= "Xaa = hydrophobic amino acid, 
Ala, Leu, lie, Val, Pro, Phe, Trp 
or Met" 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 19: 

Phe Arg Xaa He 
1 



(2) INFORMATION FOR SEQ ID NO: 20: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 6 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: peptide 



(ix) FEATURE: 

(A) NAME/ KEY : Peptide 
(3) LOCATION: 1..6 

(D) OTHER INFORMATION: /note- "consensus telomerase RT sequence 

from motif A" 

(ix) FEATURE: 

( A ) NAME/ KEY : Modi fied-site 
(3) LOCATION: 2 

(D) OTHER INFORMATION: /product- "OTHER" 

/note- "Xaa = charged amino acid, Asp, 
Glu, His, Lys or Arg" 

(ix) FEATURE : 

(A) NAME/ KEY : Modi fied-site 
(3) LOCATION: 6 

(D) OTHER INFORMATION: /product- "OTHER" 

/note-' "Xaa = hydrophobic amino acid, 
Ala, Leu, He, Val, Pro, Phe, Trp 
or Met" 



(xi) SEQUENCE DESCRIPTION: SEQ ID MO: 20: 

Pro Xaa Leu Tyr Phe Xaa 
1 5 - 



(2) INFORMATION FOR SEQ ID NO : 2 1 : 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 6 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : 

(D) TOPOLOGY: linear 
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(ii) MOLECULE TYPE: peptide 



(ix) FEATURE: 

(A) NAME/KEY: Peptide 

(B) LOCATION: 1..6 

(D) OTHER INFORMATION: /note= "consensus teiomerase RT sequence 

from motif B' " 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO:21: 

Gly He Pro Gin Gly Ser 
1 5 



(2) INFORMATION FOR SEQ ID NO: 22: 

(i) SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 4 amino acids 
(3) TYPE: amino acid 

(C) STRANDEDMESS : 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: peptide 



(ix) FEATURE: 

(A) NAME /KEY : Peptide 
(3) LOCATION: 1..4 

(D) OTHER INFORMATION: /note= "consensus teiomerase RT sequence 

from motif C" 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO:22: 

Leu Leu Arc Leu 
1 



(2) INFORMATION FOR SEQ ID NO: 23: 

(l) SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 7 amino acids 
(3) TYPE: amino acid 

(C) STRANDEDMESS: 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: peptide 



(ix) FEATURE: 

(A) NAME /KEY : Peptide 

(B) LOCATION: 1. .7 

(D) OTHER INFORMATION: /note- "consensus teiomerase RT sequence 

from motif C" 
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(ix) FEATURE: 

(A) NAME /KEY : Modi f ied-si te 

(B) -LOCATION: 5 

(D) OTHER INFORMATION: /product^ "OTHER" 

/note= "Xaa = hydrophobic amino acid, 
Ala, Leu, He, Val, Pro, Phe, Trp 
or Met" 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO:23: 

Asp Asp Phe Leu Xaa He Thr 
1 5 



(2) INFORMATION FOR SEQ ID NO : 24 : 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 48 amino acids 

(B) . TYPE: amino acid 

(C) STRANDEDNESS: 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: peptide 



( ix) FEATURE: 

(A) NAME/KEY: Peptide 

(B) LOCATION: 1..4 8 

(D) OTHER INFORMATION: /note= "motif T peptide from 

Schizosaccharqmyces pombe TRT tezlp" 



(xl) SEQUENCE DESCRIPTION: SEQ ID MO: 24: 

Trp Leu Tyr Asn Ser Phe He lie Pro He Leu Gin Ser Phe Phe Tyr 

1 5 10 15 

He Thr Glu Ser Ser Asp Leu Arg Asn Arg Thr Val Tyr Phe Arg Lys 

20 " 25 30 

Asp lie Trp Lys Leu Leu Cys Arg Pro Phe He Thr Ser Met Lys Met 

35." 4 0 4 5 



(2) INFORMATION FOR SEQ ID NO: 25: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 54 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: peptide 



(ix) FEATURE: 

(A) NAME /KEY : Peptide 

(B) LOCATION: 1 . . 54 

(D) OTHER INFORMATION: /note= "motif 1 and 2 peptide from 

Schi zosaccharomyces pombe TRT tezlp" 
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(xi) SEQUENCE DESCRIPTION: SEQ ID NO:25: 

Asn Asn Val Arg Met Asp Thr Gin Lys Thr Thr Leu Pro Pro Ala' Val 
15 10 15 

lie Arg Leu Leu Pro Lys Lys Asn Thr Phe Arg Leu He Thr Asn Leu 
20 25 30 

Arg Lys Arg Phe Leu He Lys Met Gly Ser Asn Lys Lys Met Leu Val 
35 40 45 

Ser Thr Asn Gin Thr Leu 
50 



(2) INFORMATION FOR SEQ ID NO: 26: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 34 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: peptide 



(ix) FEATURE: 

(A) NAME/ KEY : Peptide 

(B) LOCATION: 1 . . 34 

(D) OTHER INFORMATION: /note= "motif A peptide from 

Schi zosaccharomyces pombe TRT tezlp" 



(xi) SEQUENCE DESCRIPTION: SEQ ID MO: 26: 

Phe Glv Arg Lys Lvs Tyr Phe Val Arg He Asp He Lys Ser Cys Tyr 
1*5* 10 15 

Aso Arg He Lys Gin Asp Leu Met Phe Arg He Val Lys Lys Lys Leu 
20 25 30 

Lys Asp 



(2) INFORMATION FOR SEQ ID NO: 27: 

(i) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 35 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: peptide 



(A) NAME /KEY : Peptide 

(B) LOCATION: 1.-35 

(D) OTHER INFORMATION: /note- "motif B" peptide from 

Schizosaccharomyces pombe TRT tezlp" 
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(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 27: 

Ser Gin Tyr Leu Gin Lys Val Gly He Pro Gin Gly Ser He Leu Ser 
15 10 15 

Ser Phe Leu Cys His Phe Tyr Met Glu Asp Leu He Asp Glu Tyr Leu 
20 25 30 

Ser Phe Thr 
35 



(2) INFORMATION FOR SEQ ID NO: 28: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 43 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: peptide 



(ix) FEATURE: 

(A) NAME /KEY : Peptide 

(B) LOCATION: 1..43 

(D) OTHER INFORMATION: /note= "motif C and D peptide from 

Schizosaccharomyces pombe TRT tezlp' 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO:28: 



Leu Leu Arg Val Val Asp Asp Phe Leu Phe He Thr Val Asn Lys Lys 
15 10 15 

Asp Ala Lys Lys Phe Leu Asn Leu Ser Leu Arg Gly Phe Glu Lys His 
20 25 30 

Asn Phe Ser Thr Ser Leu Glu Lys Thr Val lie 
35 40 



(2) INFORMATION FOR SEQ ID NO : 2 9 : 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 11 amino acids 
(3) TYPE: amino acid 

(C) STRANDEDNESS: 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: peptide 



(ix) FEATURE: 

(A) NAME/KEY: Peptide 
(3) LOCATION: 1 . . 11 

(D) OTHER INFORMATION: /note- "motif E peptide from 

Schizosaccharomyces pombe TRT tezlp" 



(xi) SEQUENCE DESCRIPTION : SEQ ID NO: 29: 

Lys Lys Arg Met Pro Phe Phe Gly Phe Ser Val 
1 5 10 



(2) INFORMATION FOR SEQ ID NO: 30: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 48 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: peptide 



(ix) FEATURE: 

(A) NAME/KEY: Peptide 

(B) LOCATION: 1..4 8 

(D) OTHER INFORMATION: /note= "motif T peptide from human TRT" 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 30: 

Trp Leu Met Ser Val Tyr Val Val Glu Leu Leu Arg Ser Phe Phe Tyr 
1 S 10 15 

Val Thr Glu Thr Thr Phe Gin Lys Asn Arg Leu Phe Phe Tyr Arg Lys 
20 25 30 

Ser Val Trp Ser Lys Leu Gin Ser He Gly He Arg Gin His Leu Lys 
35' 40 45 



[2] INFORMATION FOR SEQ ID MO : 3 1 : 

(i) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 54 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : 

{ D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: peptide 



(ix) FEATURE: 

(A) NAME/KEY: Peptide 

(B) LOCATION: 1 . . 54 

(D) OTHER INFORMATION: /note= "motif 1 and 2 peptide from 

human TRT" 



(xi) SEQUENCE DESCRIPTION: SEQ ID MO:31: 

Glu Val Arg Gin His Arg Glu Ala Arg Pro Ala Leu Leu Thr Ser Arg 
15 10 15 



Leu Arg Phe He Pro Lys Pro Asp Gly Leu Arg Pro lie Val Asn Met 
20 25 30 
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Asp Tyr Val Val Gly Ala Arg Thr Phe Arg Arg Glu Lys Arg Ala Glu 
35 40 45 

Arg Leu Thr Ser Arg Val 
50 



(2) INFORMATION FOR SEQ ID NO:32: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 34 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: 

- (D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: peptide 



(ix) FEATURE: 

(A) NAME/KEY: Peptide 

(B) LOCATION: 1 . . 34 

(D) OTHER INFORMATION: /note- "motif A peptide from human TRT " 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 32: 

Pro Pro Pro Glu Leu Tyr Phe Val Lys Val Asp Val Thr Gly Ala Tyr 
15 10 15 

Asp Thr lie Pro Gin Asp Arg Leu Thr Glu Val He Ala Ser He He 
20 25 30 

Lys Pro 



(2) INFORMATION FOR SEQ ID NO: 33: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 35 amino acids 
(3) TYPE : amino acid 

(C) STRANDEDNESS: 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: peptide 



(ix) FEATURE: 

(A) NAME /KEY : Peptide 

(B) LOCATION: 1.-35 

(D) OTHER INFORMATION: /note= "motif 3' peptide from human TRT" 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO:33: 

Lys Ser Tyr Val Gin Cys Gin Gly lie Pro Gin Gly Ser He Leu Ser 
l' 5 10 15 

Thr Leu Leu Cys Ser Leu Cys Tyr Gly Asp Met Glu Asn Lys Leu Phe 
20 25 30 
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Ala Gly He 
35 



(2) INFORMATION FOR SEQ ID NO: 34: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 43 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: peptide 



(ix) FEATURE: 

(A) NAME/KEY: Peptide 

(B) LOCATION: 1. . 43 

(D) OTHER INFORMATION: /note= "motif C and D peptide from 

human TRT" 



{xi) SEQUENCE DESCRIPTION: SEQ ID NO: 34: 

Leu Leu Arg Leu Val Asp Asp Phe Leu Leu Val Thr Pro His Leu .Thr 

1 5 10 15 

His Ala Lys Thr Phe Leu Arg Thr Leu Val Arg Gly Val Pro Glu Tyr 

20 25 30 

Gly Cys Val Val Asn Leu Arg Lys Thr Val Val 
35 40 



(2) INFORMATION FOR SEQ ID NO: 35: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 11 amino acids 
(3) TYPE: amino acid 

(C) STRANDEDNESS: 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: peptide 



(ix) FEATURE: 

(A) NAME /KEY : Peptide 

(B) LOCATION: 1 . . 11 

(D) OTHER INFORMATION: /note= "motif E peptide from human TRT" 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 35: 



His Gly Leu Phe Pro Trp Cys Gly Leu Leu Leu 
1 ' 5 10 
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(2) INFORMATION FOR SEQ ID NO:36: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 48 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: peptide 



(ix) FEATURE: 

(A) NAME/KEY: Peptide 

(B) LOCATION: 1. .48 

(D) OTHER INFORMATION: /note= "motif T peptide from Euplotes 

aediculatus p!23" 



{xi) SEQUENCE DESCRIPTION: SEQ ID NO: 36: 

Trp He Phe Glu Asp Leu Val Val Ser Leu He Arg Cys Phe Phe Tyr 

15 10 15 

Val Thr Glu Gin Gin Lys Ser Tyr Ser Lys Thr Tyr Tyr Tyr Arg Lys 

20 25 30 

Asn He Trp Asp Val He Met Lys Met Ser He Ala Asp Leu Lys Lys 

35 40 45 



(2) INFORMATION FOR SEQ ID NO: 37: 

(i) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 54 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: peptide 



(ix) FEATURE : 

(A) NAME /KEY : Peptide 

(B) LOCATION: 1 . . 54 

(D) OTHER INFORMATION: /note= "motif 1 and 2 peptide from 

Euplotes aediculatus p!23" 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 37: 

Lys Glu Val Glu Glu Trp Lys Lys Ser Leu Gly Phe Ala Pro Gly Lys 
1 5 10 15 

Leu Arg Leu He Pro Lys Lys Thr Thr Phe Arg Pro lie Met Thr Phe 
20 25 30 

Asn Lys Lys lie Val Asn Ser Asp Arg Lys Thr Thr Lys Leu Thr Thr 
35 40 45 

Asn Thr Lys Leu Leu Asn 
50 




(2) INFORMATION FOR SEQ ID NO: 38: 

(i) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 34 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: peptide 



(ix) FEATURE: 

(A) NAME/KEY: Peptide 

(B) LOCATION: 1. . 34 

(D) OTHER INFORMATION: /note- "motif A peptide from Euplotes 

aediculatus pl23" 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 38: 

Gly Gin Pro Lys Leu Phe Phe Ala Thr Met Asp He Glu Lys Cys Tyr ' 
1 5 10 15 

Asp Ser Val Asn Arg Glu Lys Leu Ser Thr Phe Leu Lys Thr Thr Lys 
20 25 30 

Leu Leu 



(2) INFORMATION FOR SEQ ID NO: 39: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 35 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: peptide 



'(ix) FEATURE: 

(A) NAME /KEY : Peptide 
(3) LOCATION: 1..35 

(D) OTHER INFORMATION: /note= "motif B' peptide from Euplotes 

aediculatus p!23" 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 39: 

Lys Phe Tyr Lys Gin Thr Lys Gly lie Pro Gin Gly Leu Cys Val Ser 
15 10 15 

Ser He Leu Ser Ser Phe Tyr Tyr Ala Thr Leu Glu Glu Ser Ser Leu 
20 25 30 



Gly Phe Leu 
35 



4.4 



(2) INFORMATION FOR SEQ ID NO: 40: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 4 3 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: peptide 



(ix) FEATURE: 

(A) NAME/KEY: Peptide 

(B) LOCATION: 1..43 

(D) OTHER INFORMATION: /note= "motif C and D peptide from 

Euplotes aediculatus pl23" 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 4 0 : 

Leu Met Arg Leu Thr Asp Asp Tyr Leu Leu He Thr Thr Gin Glu Asn 
15 10 15 

Asn Ala Val Leu Phe He Glu Lys Leu He Asn Val Ser Arg Glu Asn 
20 25 30 

Gly Phe Lys Phe Asn Met Lys Lys Leu Gin Thr 
35 40 



(2) INFORMATION FOR SEQ ID NO : 4 1 : 

(i) SEQUENCE CHARACTERI ST ICS : 

(A) LENGTH: 11 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: peptide 



(ix) FEATURE: 

(A) NAME /KEY: Peptide 

(B) LOCATION: 1 . . 11 

(D) OTHER INFORMATION: /note= "motif E peptide from Euplotes 

aediculatus pl23" 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 4 1 : 

Gin Asd Tyr Cys Asp Trp He Gly He Ser He 
1*5 10 



(2) INFORMATION FOR SEQ ID NO : 4 2 : 

(l) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 47 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: 

(D) TOPOLOGY: linear 1 
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(ii) MOLECULE TYPE: peptide 



(ix) FEATURE: 

{A) NAME/KEY : Peptide 
(B) LOCATION: 1..47 

(D) OTHER INFORMATION: /note* "motif T peptide from 

Saccharomyces cerevisiae EST2p" 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 42: 

Trp Leu Phe Arg Gin Leu He Pro Lys He He Gin Thr Phe Phe Tyr 

15 10 15 

Cys Thr Glu He Ser Ser Thr Val Thr He Val Tyr Phe Arg His Asp 
20 25 30 

Thr Trp Asn Lys Leu He Thr Pro Phe He Val Glu Tyr Phe Lys 
35 4.0 4 5 



(2} INFORMATION FOR SEQ ID NO : 4 3 : 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 26 amino acids 
(3) TYPE: amino acid 

(C) STRANDEDNESS: 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: peptide 



(ix) FEATURE: 

(A) NAME /KEY : Peptide 

(B) LOCATION: 1 . . 26 

(D) OTHER INFORMATION: /note= "motif 1 peptide from 

Saccharomyces cerevisiae EST2p" 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 4 3 : 

Cys Arg Asn His Asn Ser Tyr Thr Leu Ser Asn Phe Asn His Ser Lys 
15 10 15 

Met Ara He He Pro Lys Lys Ser Asn Asn 
20 25 



(2) INFORMATION FOR SEQ ID NO : 4 4 : 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 29 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: peptide 
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(ix) FEATURE: 

(A) NAME/KEY : Peptide 

(B) LOCATION: 1..29 

(D) OTHER INFORMATION: /note= "motif 2 peptide from 

Saccharomyces cerevisiae EST2p" 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 44: 

Phe Arg He He Ala He Pro Cys Arg Gly Ala Asp Glu Glu Glu Phe 
1 5 10 15 

Thr He Tyr Lys Glu Asn His Lys Asn* Ala He Gin Pro 
20 25 



(2) INFORMATION FOR SEQ ID NO: 45: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 34 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: peptide 



(ix) FEATURE: 

(A) NAME /KEY : Peptide 

(B) LOCATION : 1 . . 34 

(D) OTHER INFORMATION: /note= "motif A peptide from 

Saccharomyces cerevisiae EST2p" 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 45: 

Val Leu Pro Glu Leu Tyr Phe Met Lys Phe Asp Val Lys Ser Cys Tyr 
1 5 10 15 

Asp Ser He Pro Arg Met Glu Cys Met Arg He Leu Lys Asp Ala Leu 
20 25 30 

Lys Asn 



(2) INFORMATION FOR SEQ ID MO : 4 6 : 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 35 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: peptide 



(ix) FEATURE: 

(A) NAME/KEY: Peptide 

(B) LOCATION: 1 . . 35 

(D) OTHER INFORMATION : /note= "motif B 1 peptide from 

Saccharomyces cerevisiae EST2p" 
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(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 46: 

Lys Cys Tyr He Arg Glu Asp Gly Leu Phe Gin Gly Ser Ser Leu Ser 
15 10 15 

Ala Pro He Val Asp Leu Val Tyr Asp Asp Leu Leu Glu Phe Tyr Ser 
20 25 30 

Glu Phe Lys 
35 



(2) INFORMATION FOR SEQ ID NO: 47: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 43 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: peptide 



(ix) FEATURE: 

(A) NAME /KEY : Peptide 
(3) LOCATION: 1 . . 43 

(D) OTHER INFORMATION: /note= "motif C and D peptide from 

Saccharomyces cerevisiae EST2p" 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 47: 

He Leu Lys Leu Ala Aso Asp Phe Leu He He Ser Thr Asp Gin Gin 

1 5 ' ' 10 15 

Gin Val lie Asn He Lys Lys Leu. Ala Met. Gly Gly Phe Gin Lys Tyr 
20 25 30 

Asn Ala Lys Ala Asn Arg Asp Lys He Leu Ala 
35 40 



(2) INFORMATION FOR SEQ ID MO : 4 8 : 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 11 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: peptide 



(ix) FEATURE: 

(A) NAME/KEY: Peptide 

(B) LOCATION: 1 . . 11 

(D) OTHER INFORMATION: /note= "motif E peptide from 

Saccharomyces cerevisiae EST2p" 
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(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 48: 

Lys Glu Leu Glu Val Trp Lys His Ser Ser Thr 
1 5 10 



(2) INFORMATION FOR SEQ ID NO: 49: 

<i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 4 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: peptide 



(ix) FEATURE: 

(A) NAME/KEY: Peptide 

(B) LOCATION: 1..4 

(D) OTHER INFORMATION: /note= "consensus non- t elomerase RT 

sequence from motif B'" 



(ix) FEATURE: 

(A) NAME/KEY: Modi f ied-site 

(B) LOCATION: 1 

(D) OTHER INFORMATION: /product^ "OTHER" 

/note= "Xaa = hydrophobic amino acid, 
Ala, Leu, lie, Val, Pro, Phe, Trp 
or Met" 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 4 9 : 

Xaa Pro Gin Gly 
1 



(2) INFORMATION FOR SEQ ID NO: 50: 

(i) SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 5 amino acids 
(3) TYPE: amino acid 

(C) STRANDEDNESS: 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: peptide 



(ix) FEATURE: 

(A) NAME/KEY: Peptide 

(B) LOCATION: 1.-5 

(D) OTHER INFORMATION: /note= "consensus non-telomerase RT 

sequence from motif C" 
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(ix) FEATURE: 

(A) NAME/KEY: Modi f ied-site 

(B) LOCATION: 3 

(D) OTHER INFORMATION: /product^ "OTHER" 

/note= "Xaa = hydrophobic amino acid, 
Ala, Leu, He, Val, Pro, Phe, Trp 
or Met" 

(ix) FEATURE: 

(A) NAME/KEY: Modi f ied-site 

(B) LOCATION: 4 

(D) OTHER INFORMATION: /product= "OTHER" 

/note= "Xaa = hydrophobic amino acid, 
Ala, Leu, lie, Val, Pro, Phe, Trp 
or Met" 

(ix) FEATURE: , 

(A) NAME/KEY: Modif ied-site 

(B) LOCATION: 5 

(D) OTHER INFORMATION: /product= "OTHER" 

/note= "Xaa = hydrophobic amino acid, 
Ala, Leu, He, Val, Pro, Phe, Trp 
or Met" 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 50: 

Asp Asp Xaa Xaa Xaa 
1 5 



(2) INFORMATION FOR SEQ ID NO: 51: 

(!) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 55 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDMESS : 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: peptide 



(ix) FEATURE: 

(A) NAME /KEY : Peptide 

(B) LOCATION: 1 . . 55 

(D) OTHER INFORMATION: /note- "motif 1 and 2 peptide from 

Saccharomyces cerevisiae cytochrome 
oxidase group II intron 1-encoded 
mitochondrial protein" 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 5 1 : 

Leu Ser Asn Glu Leu Gly Thr Gly Lys Phe Lys Phe Lys Pro Met Arg 
15 10 15 

He Val Asn He Pro Lys Pro Lys Gly Gly He Arg Pro Leu Ser Val 
20 25 30 



Gly Asn Pro Arg Asp Lys He Val Gin Glu 
35 40 



Val 



Met Arg 
45 



Met He Leu 



Asp Thr lie Phe Asp Lys Lys 
50 55 



(2) INFORMATION FOR SEQ ID NO: 52: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 34 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: peptide 



(ix) FEATURE: 

(A) NAME/KEY: Peptide 

(B) LOCATION: 1 . . 34 

(D) OTHER INFORMATION: /note- "motif A peptide from 

Saccharomyces cerevisiae cytochrome 
oxidase group II intron 1-encoded 
mitochondrial protein" 



(>:i) SEQUENCE DESCRIPTION: SEQ ID NO: 52: 

Phe Gly Gly Ser Asn Trp Phe lie Glu Val Asp Leu Lys Lys Cys Phe 
1 5 10 15 

Asp Thr lie Ser His Asp Leu He lie Lys Glu Leu Lys Arg Tyr He 
20 ' 25 30 

Ser Aso 



INFORMATION FOB. SEQ ID MO : 5 2 : 

(i) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 35 amino acids 
(3) TYPE: amino acid 
CO STRANDEDNESS : 
(Dj TOPOLOGY: linear 

(ii) MOLECULE TYPE: peptide 



(ix) FEATURE: 

(A) NAME/ KEY : Peptide 

(B) LOCATION : 1 . . 35 

(D) OTHER INFORMATION: /note- "motif B' peptide from 

Saccharomyces cerevisiae cytochrome 
oxidase group II intron 1-encoded 
mitochondrial protein" 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 53: 



Thr Tyr His Lys Pro Met Leu Gly Leu Pro Gin Gly Ser Leu He Ser 
15 10 15 
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Pro lie Leu Cys Asn lie Val Met Thr Leu Val Asp Asn Trp Leu Glu 
20 25 30 

Asp Tyr lie 
35 



(2) INFORMATION FOR SEQ ID NO: 54: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 17 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: peptide 



(ix) FEATURE: 

(A) NAME/KEY: Peptide 

(B) LOCATION: 1. . 17 

(D) OTHER INFORMATION: /note= "motif C peptide from 

Saccharomyces cerevisiae cytochrome 
oxidase group II intron 1-encoded 
mitochondrial protein" 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 54: 

Tyr Val Arg Tyr Ala Asp Asp lie Leu lie Gly Val Leu Gly Ser Lys 
1 5 ^ 10 15 

Asn 



INFORMATION FOR SEQ ID NO : 5 d : 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 25 amino acids 
(3) TYPE: amino acid 

(C) STRANDEDNESS : 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: peptide 



(ix) FEATURE : 

(A) NAME/KEY: Peptide 

(B) LOCATION: 1 . . 2 5 

(D) OTHER INFORMATION: /note= "motif D peptide from 

Saccharomyces cerevisiae cytochrome 
oxidase group II intron 1-encoded 
mitochondrial protein" 



(xi) SEQUENCE DESCRIPTION: SEQ ID MO:55: 

Lys Met lie Lys Arg Asp Leu Asn Asn Phe Leu Asn Ser Leu Gly Leu 
1-5 10 15 




Thr He Asn Glu Glu Lys Thr Leu He 
20 25 



(2) INFORMATION FOR SEQ ID NO: 56: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 11 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: peptide 



(ix) FEATURE: 

(A) NAME/KEY: Peptide 
(3) LOCATION: 1..11 

(D) OTHER INFORMATION: /note= "motif E peptide from 

Saccharomyces cerevisiae cytochrome 
oxidase group II intron 1-encoded 
mitochondrial protein" 



(xi) SEQUENCE DESCRIPTION: SEQ ID MO: 56: 

Glu Thr Pro Ala Arg Phe Leu Gly Tyr Asn He 
1 5 10 



(2} INFORMATION FOR SEQ ID NO: 57: 

(1} SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 26 amino acids 
(3) TYPE: amino acid. 

(C) STRANDEDNESS : 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: peptide 



(ix) FEATURE: 

(A) NAME /KEY : Peptide 

(B) LOCATION: 1 . . 26 

(D) OTHER INFORMATION: /note= "motif 1 peptide from 

Drosophila melanogaster TART non-LTR 
retrotransposable element reverse 
transcriptase " 



(xi) SEQUENCE DESCRIPTION: SEQ ID MO: 57: 

Ser He Leu Arg He Gly Tyr Tyr Pro Asp Ala Trp Lys His Ala Gin 
15 10 15 



Val Lys Met He Leu Lys Pro Gly Lys Ser 
20 25 
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(2) INFORMATION FOR SEQ ID NO: 58: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 29 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: peptide 



(ix) FEATURE: 

(A) NAME/KEY: Peptide 

(B) LOCATION: 1. .29 

{ D) OTHER INFORMATION: /note= "motif 2 peptide from 

Drosophila melanogaster TART non-LTR 
retrotransposable element reverse 
transcriptase" 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 58: 

Tyr Arg Pro lie Ser Leu Leu Ser Gly Leu Ser Lys Met Phe Glu Arg 
15 10 15 

Leu Leu Leu Lys Arg Leu ' Phe Arg Val Asp Leu Phe Lys 
20 25 



{2} INFORMATION FOR SEQ ID MO: 59: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 34 amino acids 
(3) TYPE: amino acid 

(C) STRANDEDNESS: 

(D) TOPOLOGY : linear 

ill) MOLECULE TYPE: peptide 



(ix) FEATURE: 

(A) NAME/KEY: Peptide 

(B) LOCATION: 1 . . 34 

(D) OTHER INFORMATION: /note= "motif A. peptide from 

Drosophila melanogaster TART non-LTR 
retrotransposable element reverse 
transcriptase" 



(xi) SEQUENCE DESCRIPTION: SEQ ID MO: 59: 

Arg Lys Glu Tyr Cys Ser Ala Val ?he Leu Asp He Ser Glu Ala Phe 
15 10 15 

Asp Arg Val Trp His Glu Gly Leu Leu Leu Lys Leu Ala Lys lie Leu 
20 25 30 



Pro Tyr 




(2) INFORMATION FOR SEQ ID NO: 60: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 35 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: peptide 



(ix) FEATURE: 

(A) NAME/KEY: Peptide 

(B) LOCATION: 1 . . 35 
<D) OTHER INFORMATION 



: /note= "motif B' peptide from 

Drosophila melanogaster TART non-LTR 
retrotransposable element reverse 
transcriptase" 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 60: 

Arg Ala Gly Gin He Gly Ala Gly Val Pro Gin Gly Ser Asn Leu Gly 
1 5 10 15 

Pro lie Leu Tyr Ser He Phe Ser Ser Asp Met Pro Leu Pro His He 
20 25 30 

Tyr His Pro 
35 



.2) INFORMATION FOR SEQ ID MO: 61: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 17 amino acids 
(3) TYPE: amino acid 

(C) STRANDEDNESS : 

(D) TOPOLOGY : linear 

(ii) MOLECULE TYPE: peptide 



(ix) FEATURE: 

(A) NAME/KEY : Peptide 

(B) LOCATION: 1 . . 17 

(D) OTHER INFORMATION: /note- "motif C peptide from 

Drosophila melanogaster TART non-LTR 
retrotransposable element reverse 
transcriptase " 



[xi) SEQUENCE DESCRIPTION: SEQ ID NO: 61: 

Leu Ser Thr Tyr Ala Asp Asp Thr He Val Leu Ser Ser Asp He Leu 
I 5 10 15 



Ala 
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(2) INFORMATION FOR SEQ ID NO: 62: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 26 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: peptide 



(ix) FEATURE: 

(A) NAME/KEY: Peptide 

(B) LOCATION: 1..26 

(D) OTHER INFORMATION: /note= "motif D peptide from 

Drosophila melanogaster TART non-LTR 
retrotransposable element reverse 
transcriptase" 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO:62: 



Asn Glu Asn Tyr Leu Lys Thr Phe Ser Asp Trp Ala Asp Lys Trp Gly 
1 5 10 15* 

He Ser Val Asn Ala Ala Lys Thr Gly His 
20 25 



(.2) INFORMATION FOR SEQ ID NO; 63: 

(l) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 11 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : 

\D) TOPOLOGY: linear 

(ii) XOLECULE TYPE: peptide 



(ix) FEATURE : 

(A) NAME/KEY: Peptide 
(3) LOCATION: 1 . . 11 

{ D) OTHER INFORMATION: /note= "motif E peptide from 

Drosophila melanogaster TART non-LTR 
retrotransposable element reverse 
transcriptase" 



(xi) SEQUENCE DESCRIPTION: SEQ ID MO: 63: 



Glu Ser Lys Gin Ser Tyr Leu Gly Val He Leu 
1 5 10 
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(2) INFORMATION FOR SEQ ID NO: 64: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 26 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: peptide 



(ix) FEATURE: 

(A) NAME/KEY: Peptide 
.(B) LOCATION: 1..26 

(D) OTHER INFORMATION : /note= "motif 1 peptide from HIV-1 

reverse transcriptase" 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 64: 

Glu Gly Lys He Ser Lys He Gly Pro Glu Asn Pro Tyr Asn Thr Pro 
15 10 15 

Val Phe Ala He Lys Lys Lys Asp Ser Thr 
20 25 



(2) INFORMATION FOR SEQ ID NO: 65: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 63 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : 

(D) TOPOLOGY: linear 

!u) MOLECULE TYPE: peptide 



{!:-:) FEATURE: 

(A) NAME /KEY: Peptide 
(3) LOCATION: 1,-63 

(D) OTHER INFORMATION: /note- "motif 2 and A peptide from HIV-1 

reverse transcriptase" 



(xi) 'SEQUENCE DESCRIPTION: SEQ ID NO: 65: 

Trp Arg Lys Leu Val Asp Phe Arg Glu Leu Asn Lys Arg Thr Gin Asp 

1 5 10 15 

Phe Tro Glu Val Gin Leu Gly He Pro His Pro Ala Gly Leu Lys Lys 
20 25 30 

Lys Lys Ser Val Thr Vai Leu Asp Val Gly Asp Ala Tyr Phe Ser Val 
35 4 0 4 5 



Pro Leu Asp Glu Asp Phe Arg Lys Tyr Thr Ala Phe Thr He Pro 
50 55 60 




(2) INFORMATION FOR SEQ ID NO: 66: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 35 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: peptide 



(ix) FEATURE: 

(A) NAME /KEY : Peptide 

(B) LOCATION: 35 

(D) OTHER INFORMATION: /note= "motif B 1 peptide from HIV-1 

reverse transcriptase" 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO:66: 

Gly lie Arg Tyr Gin Tyr Asn Val Leu Pro Gin Gly Trp Lys Gly Ser 
15 10 15 

Pro Ala He Phe Gin Ser Ser Met Thr Lys He Leu Glu Pro Phe Lys 
20 25 30 

Lys Gin Asn 
35 



(2) INFORMATION FOR SEQ ID NO: 67: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 17 amino acids 

(3) TYPE: amino acid 

(C) STRANDEDNESS : 

{ D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: peptide 



(ix) FEATURE: 

(A) NAME/ KEY : Peptide 

(B) LOCATION: 1 . . 17 

(D) OTHER INFORMATION: /note= "motif C peptide from HIV-1 

reverse transcriptase" 



(xi) SEQUENCE DESCRIPTION: SEQ ID MO: 67: 

He Tyr Gin Tyr Met Asp Asp Leu Tyr Val Gly Ser Asp Leu Glu He 
1 ' 5 10 15 



Gly 
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(2) INFORMATION FOR SEQ ID NO: 68: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 37 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: peptide 



(ix) FEATURE: 

{A) NAME/KEY: Peptide 
' (B) LOCATION: 1 . . 37 

(D) OTHER INFORMATION: /note= "motif D and E peptide from HIV-1 

reverse transcriptase" 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 68: 

His A^a Thr Lys lie Glu Glu Leu Arg Gin His Leu Leu Arg Trp Gly 

1 ^ 5 10 15 

Leu Thr Thr Pro Asp Lys Lys His Gin Lys Glu Pro Pro Phe Leu Trp 
20 25 30 

Met Gly He Thr Leu 
35 



(2) INFORMATION FOR SEQ ID NO: 69: 

(i) SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 4 amino acids 
(3) TYPE: amino acid 
(C) STRANDEDNESS : 
{ D ) TOPOLOGY: linear 

(ii) MOLECULE TYPE: peptide 



(ix) FEATURE: 

(A) NAME/KEY: Peptide 
(3) LOCATION: 1 . . 4 

(D) OTHER INFORMATION: /note- "consensus telomerase RT finger 

sequence from motif 1" 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 69: 

He Pro Lys Lys 
1 



(2) INFORMATION FOR SEQ ID NO: 70: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 5 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: 

(D) TOPOLOGY: linear 
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(ii) MOLECULE TYPE: peptide 



(ix) FEATURE: 

(A) NAME/KEY: Peptide 

(B) LOCATION: 1 . . 5 

(D) OTHER INFORMATION: /note= "consensus telomerase RT palm, 

primer grip sequence from motif C" 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 70: 

Leu Leu Leu Arg Leu 
1 5 



(2) INFORMATION FOR SEQ ID NO: 71: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 4 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: 
■(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: peptide 



(ix) FEATURE: 

(A) NAME /KEY : Peptide 
(3) LOCATION: 1 . . 4 

(D) OTHER INFORMATION: /note= "consensus telomerase RT palm, 

primer grip sequence from motif C" 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 7 1 : 

Asp Asp Phe Leu 
I 



(2) INFORMATION FOR SEQ ID MO: 72: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 43 amino acids 
(3) TYPE: amino acid 

(C) STRANDEDNESS : 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: peptide 



(ix) FEATURE: 

(A) NAME /KEY : Peptide 

(B) LOCATION: 1 . . 43 

(D) OTHER INFORMATION: /note= "telomerase specific motif T 

peptide from human TRT " 
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(xi) SEQUENCE DESCRIPTION: SEQ ID NO:72: 

Trp Leu Met Ser Val Tyr Val Val Glu Leu Leu Arg Ser Phe Phe Tyr 
1 5 10 15 

Val Thr Glu Thr Thr Phe Gin Lys Asn Arg Leu Phe Phe Tyr Arg Lys 
20 25 30 

Ser Val Trp Ser Lys Leu Gin Ser He Gly He 
35 40 



(2) INFORMATION FOR SEQ ID NO: 73: 

(i) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 5 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : 

{ D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: peptide 



(ix) FEATURE: 

(A) NAME /KEY : Peptide 

( B ) LOCATION: 1 . . 5 

(D) OTHER INFORMATION: /note= "telomerase specific motif T' 

peptide from human TRT" 



(>:i) SEQUENCE DESCRIPTION: SEQ ID MO:73: 

Glu Ala Glu Val Arg 
1 5 



(2) IN FORMATION FOR SEQ ID NO: 74: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 16 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: peptide 



(ix) FEATURE: 

(A) NAME/KEY: Peptide 

(B) LOCATION: 1 . . 16 

(D) OTHER INFORMATION: /note= "telomerase RT finger motif 

1 and 2 peptide from human TRT" 



(xi) SEQUENCE DESCRIPTION: SEQ ID MO:74: 



Ser Arg Leu Arg Phe lie Pro Lys Pro Asp Gly Leu Arg Pro lie Val 
1 5 10 " ' 15 
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(2) INFORMATION FOR SEQ ID NO: 75: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 17 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: peptide 



(ix) FEATURE: 

(A) NAME/KEY: Peptide 

(B) " LOCATION: 1..17 

(D) OTHER INFORMATION: /note= "telomerase RT finger motif A 

peptide from human TRT" 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO:75: 

Pro Glu Leu Tyr Phe Val Lys Val Asp Val Thr Gly Ala Tyr Asp Thr 
15 10 15 

lie 



(2) INFORMATION FOR SEQ ID NO : 7 6 : 

(i) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 22 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: peptide 



(ix) FEATURE: 

(A) NAME/ KEY: Peptide 

(B) LOCATION: 1 . . 22 

(D) OTHER INFORMATION: /note= "telomerase RT finger mccif 

oeotide from human TRT" 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 7 6 : 

Tvr Val Gin Cys Gin Gly He Pro Gin Gly Ser He Leu Ser Thr Leu 
I 5 10 15 

Leu Cys Ser Leu Cys Tyr 
20 



(2) INFORMATION FOR SEQ ID NO: 77: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 13 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: 

(D) TOPOLOGY: linear 
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(ii) MOLECULE TYPE: peptide 



(ix) FEATURE: 

(A) NAME/KEY : Peptide 

(B) LOCATION: 1..13 

(D) OTHER INFORMATION: /note= "telomerase RT palm, primer grip 

motif C peptide from human TRT " 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO:77: 

Leu Leu Leu Arg Leu Val Asp Asp Phe Leu Leu Val Thr 
1 5 10 



(2) INFORMATION FOR SEQ ID NO:78: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 16 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: peptide 



{ix) FEATURE: 

(A) NAME/KEY: Peptide 

(B) LOCATION: 1 . . 16 

(D) OTHER INFORMATION: /note- "telomerase RT palm, primer grip 

motif D peptide from human TRT" 



(>:i) SEQUENCE DESCRIPTION: SEQ ID NO:78: 

Giy Va 1 Pro Glu Tyr Gly Cys Val Val Asn Leu Arg Lys Thr Val Val 
1 5 10 ■ 15 



(2) INFORMATION FOR SEQ ID NO : 7 9 : 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 11 amino acids 
- (B) TYPE: amino acid 

(C) STRANDEDNESS: 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: peptide 



(ix) FEATURE: 

(A) NAME/ KEY : Peptide 

(B) LOCATION: 1. . 11 

(D) OTHER INFORMATION: /note= "telomerase RT palm, primer grip 

motif E peptide from human TRT" 
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(xi) SEQUENCE DESCRIPTION: SEQ ID NO:79: 

Trp Cys Gly Leu Leu Leu Asp Thr Arg Thr Leu 
15 10 



(2) INFORMATION FOR SEQ ID NO:80: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 43 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: peptide 



(ix) FEATURE: 

(A) NAME/KEY: Peptide 

(B) LOCATION: 1. .43 

(D) OTHER INFORMATION: /note= "telomerase specific motif T 

peptide from Schizosaccharomyces 
pombe TRT" 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO:80:. 

Tro Leu Tyr Asn Ser Phe lie He Pro He Leu Gin Ser Phe Phe Tyr 

1 5 10 15 

He Thr Glu Ser Ser Asp Leu Arg Asn Arg Thr Val Tyr Phe Arg Lys 
20 25 30 

Asp He Trp Lys Leu Leu Cys Arg Pro Phe He 
35' 40 



INFORMATION FOR SEQ ID NO : 8 1 : 

(1) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 5 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: peptide 



(ix) FEATURE: 

(A) NAME/KEY: Peptide 

(B) LOCATION: 1 . . 5 

(D) OTHER INFORMATION: /note- "telomerase specific motif T' 

peptide from Schizosaccharomyces 
oombe TRT" 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 8 1 : 



Glu Asn Asn Val Arg 
1 5 



(2) INFORMATION FOR SEQ ID NO: 82: 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 16 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: peptide 



(ix) FEATURE: 

(A) NAME/KEY: Peptide 
(8) LOCATION: 1..16 

(D) OTHER INFORMATION: /note= "telomerase RT finger motif 

1 and 2 peptide from Schizosaccharomyces 
pombe TRT" 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO:82: 

Ala Val lie Arg Leu Leu Pro Lys Lys Asn Thr Phe Arg Leu lie Thr 
15 10 15 



(2/ INFORMATION FOR SEQ ID NO: 83: 

(i) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 17 amino acids 
(3) TYPE : amino acid 

(C) STRANDEDNESS: 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: peptide 



(ix) FEATURE: 

{A) NAME /KEY : Peptide 
{3) LOCATION: 1 . . 17 

( D) OTHER INFORMATION : /note= "telomerase RT finger motif A 

peptide from Schizosaccharomyces 
pombe TRT" 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 83: 

Arg Lys Lys Tyr Phe Val Arg lie Asp lie Lys Ser Cys Tyr Asp Arg 
1 5 10 15* 

lie 



(2) INFORMATION FOR SEQ ID NO: 84: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 22 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE : peptide 



(ix) FEATURE: 

(A) NAME /KEY : Peptide 

(B) LOCATION; 1 . .22 

(D) OTHER INFORMATION: /note- "telomerase RT finger motif B' 

peptide from Sehizosaccharomyces 
pombe TRT" 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO:84: 

Tyr Leu Gin Lys Val Gly He Pro Gin Gly Ser He Leu Ser Ser Phe 
1 5 10 15 

Leu Cys His Phe Tyr Met 
20 



(2) INFORMATION FOR SEQ ID NO: 85: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 13 amino acids 

(B) TYPE: amino acid 
CO STRANDEDNESS: 

(D) TOPOLOGY: linear 

MOLECULE TYPE: peptide 



(ix) FEATURE: 

(A) NAME /KEY : Peptide 
(3} LOCATION: 1 . . 13 

(D) OTHER INFORMATION: /note- "telomerase RT palm, primer grip 

motif C peptide from Sehizosaccharomyces 
pombe TRT" 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 8 5 : 

Val Leu Leu Arg Val Val Asp Asp Phe Leu Phe lie Thr 
1 5 10 



(2) IN FORMATION FOR SEQ ID NO: 86: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 16 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: peptide 



{!:■:) FEATURE: 

(A) NAME/ KEY : Peptide 

(B) LOCATION: 1 . . 16 
(D) OTHER INFORMATION 



/note= "telomerase RT palm, primer grip 
motif D peptide from Sehizosaccharomyces 
pombe TRT" 



66 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO:86: 

Gly Phe Glu Lys His Asn Phe Ser Thr Ser Leu Giu Lys Thr Val He 
1.5 10 15 



(2) INFORMATION FOR SEQ ID NO: 87: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 11 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: peptide 



lix) FEATURE: 

(A) NAME/ KEY : Peptide 
(3) LOCATION: 1..11 

( D) OTHER INFORMATION : /note- " telomerase RT palm, primer grip 

motif E peptide from Schi zosaccharomyces 
pombe TRT " 



(xi) SEQUENCE DESCRIPTION: SEQ ID MO: 87: 

Phe Phe Gly Phe Ser Val Asn Met Arg Ser Leu 
1 5 10 



INFORMATION FOR SEQ ID MO: S3: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 4 3 amino acids 
(3) TYPE: amino acid 

(C) STRANDEDNESS : 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: peptide 



(ix) FEATURE: 

(A) NAME/KEY: Peptide 
(3) LOCATION: 1 . . 43 

(D) OTHER INFORMATION: /note- "telomerase specific motif T 

peptide from Euplotes aediculatus pl23' 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 8 8 : 

Tro Tie phe Glu Asp Leu Val Val Ser Leu He Arg Cys Phe Phe Tyr 
1 5 10 15 

Val Thr Glu Gin Gin Lys Ser Tyr Ser Lys Thr Tyr Tyr Tyr Arg Lys 
20 25 30 



Asn He Trp Asp Val He Met Lys Met Ser 
35 40 



He 




(2) INFORMATION FOR SEQ ID NO: 89: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 5 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: 

(.D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: peptide 



(ix) FEATURE: 

(A) NAME/KEY:- Peptide 

(B) LOCATION: 1..5 

{D} OTHER INFORMATION: /note= "telomerase specific motif T 1 

peptide from Euplotes aediculatus p!23" 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 89: 

Giu Lys Glu Val Glu 
1 5 



(2) INFORMATION FOR SEQ ID NO: 90: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 16 amino acids 
(3) TYPE: amino acid 

(C) STRANDEDNESS : 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: peptide 



(A) NAME/KEY: Peptide 
(3) LOCATION: 1 . -.16 

(D) OTHER INFORMATION: /note- "telomerase RT finger motif 

1 and 2 peptide from Euplotes 
aediculatus pl23" 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 90: 

Gly Lys Leu Arg Leu lie Pro Lys Lys Thr Thr Phe Arg Pro He Met 
15 10 15 



(2) INFORMATION FOR SEQ ID NO: 91: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 17 amino acids 
(3) TYPE: amino acid 

(C) STRANDEDNESS : 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: peptide 
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(ix) FEATURE: 

{A) NAME/KEY: Peptide 
(B) LOCATION: 1 . . 17 

(D) OTHER INFORMATION: /note= "telomerase RT finger motif A 

peptide from Euplotes aediculatus pl23" 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 91: 

Pro Lys Leu Phe Phe Ala Thr Met Asp He Glu Lys Cys Tyr Asp Ser 
1 5 10 15 

Val 



(2) INFORMATION FOR SEQ ID NO: 92: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 22 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: peptide 



(ix) FEATURE: 

(A) NAME /KEY: Peptide 
(3) LOCATION: 1 . . 22 

(D) OTHER INFORMATION: /note= "telomerase RT finger motif B' 

peptide from Euplotes aediculatus pl23' 



SEQUENCE DESCRIPTION: SEQ ID NO: 92: 

Tyr L-"'S Gin Thr Lys Gly He Pro Gin Gly Leu Cys Val Ser Ser II* 
1 ^ 5 10 15 

Leu Ser Ser Phe Tyr Tyr 
20 



(2) IN FORMATION FOR SEQ ID NO: 93: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 13 amino acids 

(B) TYPE: amino acid 
(C} STRANDEDNESS: 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: peptide 



(ix) FEATURE: 

(A) NAME/KEY: Peptide 

(B) LOCATION: 1 . . 13 

(D) OTHER INFORMATION: /note= "telomerase RT palm, primer grip 

motif C peptide from Euplotes 
aediculatus pl23" 
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(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 93: 

Leu Leu Met Arg Leu Thr Asp Asp Tyr Leu Leu He Thr 
1 5 10 



(2) INFORMATION FOR SEQ ID NO: 94: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 16 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: peptide 



(ix) FEATURE: 

(A) NAME /KEY : Peptide 

(B) LOCATION: 1..16 

(D) OTHER INFORMATION: /note= "telomerase RT palm, primer grip 

motif D peptide from Euplotes 
aediculatus pi 2 3" 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 94: 

Val Ser Arc Glu Asn Gly Phe Lys Phe Asn Met Lys Lys Leu Gin Thr 
1 ' ' 5 10 15 



(2) INFORMATION FOR SEQ ID NO: 95: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 11 amino acids 
(3) TYPE: amino acid 

(C) STRANDEDNESS : 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: peptide 



(ix) FEATURE: 

(A) NAME /KEY : Peptide 
(3) LOCATION: 1 . . 11 

(D) OTHER INFORMATION: /note= "telomerase RT palm, primer grip 

motif E peptide from Euplotes 
aediculatus pl23" 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 95: 

Trp He Gly He Ser He Asp Met Lys Thr Leu 
1 5 10 
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(2) INFORMATION FOR SEQ ID NO: 96: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 42 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: 

(D) TOPOLOGY: linear 

(ii)' MOLECULE TYPE: peptide 



(ix) FEATURE: 

(A) NAME/KEY: Peptide 

(B) LOCATION: 1..42 

(D) OTHER INFORMATION: /note= " telomerase specific motif T 

peptide from Saccharomyces 
cerevisiae EST2 " 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 96: 

Trp Leu Phe Arg Gin Leu He Pro Lys He He Gin Thr Phe Phe Tyr 

1 5 10 15 

Cys Thr Glu He Ser Ser Thr Val Thr He Val Tyr Phe Arg His Asp 
20 25 30 

Thr Trp Asn Lys Leu He Thr Pro Phe He 
35 40 



(2; INFORMATION FOR SEQ ID NO: 97: 

(i) SEQUENCE CHARACTERISTICS: 
{A) LENGTH; 5 amino acids 
(2) TYPE: amino acid 
(C) STRANDEDNESS: 
{ D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: peptide 



(ix) FEATURE: 

(A) NAME /KEY: Peptide 
(3) LOCATION: 1 . . 5 

(D) OTHER INFORMATION: /note= "telomerase specific motif T* 

peptide from Saccharomyces 
cerevisiae EST2" 



(xi) SEQUENCE DESCRIPTION: SEQ ID MO: 97: 

Glu Asn Asn Val Cys 
1 5 
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(2) INFORMATION FOR SEQ ID NO: 98: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 11 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: peptide 



(ix) FEATURE: 

(A) NAME/KEY: Peptide 

(B) LOCATION: 1 . . 11 

(D) OTHER INFORMATION: /note= "telomerase RT finger motif 1 

peptide from Saccharomyces 
cerevisiae EST2 " 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 98: 

Ser Lys Met Arg He He Pro Lys Lys Ser Asn 
1 5 10 



(2) INFORMATION FOR SEQ ID NO: 99: 

(i) SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 5 amino acids 
(3) TYPE: amino acid 

(C) STRANDEDNESS : 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: peptide 



(ix) FEATURE: 

(A) NAME/KEY: Peptide 

(B) LOCATION: 1. . 5 

(D) OTHER INFORMATION: /note= "telomerase RT finger mocif 2 

peptide from Saccharomyces 
cerevisiae EST2" 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 99: 

Phe Arg He He Ala 
1 5 



(2) INFORMATION FOR SEQ ID MO: 100: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 17 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: 

(D) TOPOLOGY: linear 



(ii) MOLECULE TYPE: peptide 
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(ix) FEATURE: 

(A) NAME/KEY: Peptide 

(B) LOCATION: 1 . . 17 

(D) OTHER INFORMATION: /note- "telomerase RT finger motif A 

peptide from Saccharomyces 
cerevisiae EST2" 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 100: 

Pro Glu Leu Tyr Phe Met Lys Phe Asp Val Lys Ser Cys Tyr Asp Ser 
1 5 10 15 

He 



(2) INFORMATION FOR SEQ ID NO: 101: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 22 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: peptide 



(ix) FEATURE: 

(A) NAME /KEY : Peptide 

(B) LOCATION: 1 . . 22 

(D) OTHER INFORMATION : /note= "telomerase RT finger motif B' 

peptide from Saccharomyces 
cerevisiae EST2" 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 101: 

T y r lie A r g G i u Asp G 1 y Lei: Phe Gin G 1 y Ser S e r Leu Ser Ala Pro 
1 5 10 15 

lie Val Asp Leu Val Tyr 
20 



(2) INFORMATION FOR SEQ ID NO: 102: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 13 amino acids 

(B) TYPE: amino acid ( 

(C) STRANDEDNESS: 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: peptide 



(ix) FEATURE: 

(A) NAME /KEY : Peptide 

(B) LOCATION: 1 . . 13 

(D) OTHER INFORMATION: /note- "telomerase RT palm, primer grip 

motif C peptide from Saccharomyces 
cerevisiae EST2" 
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(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 102: 

Leu lie Leu Lys Leu Ala Asp Asp Phe Leu lie lie Ser 
15 10 



(2) INFORMATION FOR SEQ ID NO: 103: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 16 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: peptide 



(ix) FEATURE: 

(A) NAME /KEY : Peptide 

(B) LOCATION: 1. .16 

(D) OTHER INFORMATION: /note* "telomerase RT palm, primer grip 

motif D peptide from Saccha romyces 
cerevisiae EST2 " 



(:-:i) SEQUENCE DESCRIPTION: SEQ ID NO: 103: 

Gly Phe Gin Lys Tyr Asn Ala Lys Ala Asn Arg Asp Lys lie Leu Ala 
15 10 15 



12) INFORMATION FOR SEQ ID NO: 104: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 11 amino acids 
(3) TYPE: amino acid 

(C) STRANDEDNESS : 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: peptide 



( ix) FEATURE : 

(A) NAME /KEY : Peptide 

(B) LOCATION: 1 . . 11 

(D) OTHER INFORMATION: /note- "telomerase RT palm, primer grip 

motif E peptide from Saccharomyces 
cerevisiae EST2 " 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 104: 



Trp Lys His Ser Ser Thr Met Asn Asn Phe His 
1 5 10 



(2) INFORMATION FOR SEQ ID NO: 105: 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 10 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 



(ix) FEATURE: 

(A) NAME/KEY: protein_bind 

(B) LOCATION: 1 . . 10 

(D) OTHER INFORMATION: /note= "NFkappaB CS1 binding site motif 



Cxi) SEQUENCE DESCRIPTION: SEQ ID NO: 105: 

GGGRHTYYHC 10 



(2) INFORMATION FOR SEQ ID NO: 106: 

(i) SEQUENCE CHARACTERISTICS: 
{A} LENGTH: 11 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 



(ix) FEATURE: 

(A) NAME /KEY : proteinjoind 
(3) LOCATION: 1 . . 11 

(D) OTHER INFORMATION : /note= "NFkappaB MHC 1,2 binding 

site no t i f " 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 106: 
GGGCTTCCC C 11 



(2) INFORMATION FOR SEQ ID NO: 107: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 11 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 



(ix) FEATURE: 

(A) NAME/KEY: pro te injoind 

(B) LOCATION: 1 . . 11 

(D) OTHER INFORMATION: /note= "NFkappaB CS2 binding site motif" 
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(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 107: 

RGGGRMTYYC C H 



(2) INFORMATION FOR SEQ ID NO: 108: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 18 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 



(ix) FEATURE: - 

(A) NAME /KEY : protein_bind 
(3) LOCATION: 1..18 

(D) OTHER INFORMATION: /note= " topoisomerase II cleavage 

site motif" 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 108: 
RNYMMCNMGY MGKTMYNY 18 

(2) INFORMATION FOR SEQ ID NO: 109: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 3279 base pairs 
( 3 ) TYPE : nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY : linear 

(ii) MOLECULE TYPE: DMA (genomic) 



(ix) FEATURE: 

(A) NAME/KEY: CDS 

(3) LOCATION: 101 . . 3196 

(D) OTHER INFORMATION: /note= "Euplotes aediculatus 123 kDa 

telomerase protein subunit (T?vT} M 
/codon= (seq: "tga", aa : Cys) 
/product^ "Euplotes aediculatus 123 kDa 
teloraerase protein subunit (TRT)" 

( xi ) SEQUENCE DESCRIPTION: SEQ ID MO: 109: 

AAAACCCCAA AACCCCAAAA CCCCTTTTAG AGCCCTGCAG TTGGAAATAT AACCTCAGTA 60 

TTAATAAGCT CAGATTTTAA ATATTAATTA CAAAACCTAA ATG GAG GTT GAT GTT 1-15 

Met Glu Val Asp Val 
1 5 



GAT AAT CAA GCT GAT AAT CAT GGC ATT CAC TCA GCT CTT AAG ACT TGT 
Asp Asn Gin Ala Asp Asn His Gly lie His Ser Ala Leu Lys Thr Cys 
10 15 20 



163 
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GAA GAA ATT AAA GAA GCT AAA ACG TTG TAC TCT TGG ATC CAG AAA GTT 211 
Glu Glu He Lys Glu Ala Lys Thr Leu Tyr Ser Trp He Gin Lys Val 
25 30 35 

ATT AGA TGA AGA AAT CAA TCT CAA AGT CAT TAT AAA GAT TTA GAA GAT 2 59 

He Arg Cys Arg Asn Gin Ser Gin Ser His Tyr Lys Asp Leu Glu Asp 
40 45 50 

ATT AAA ATA TTT GCG CAG ACA AAT ATT GTT GCT ACT CCA CGA GAC TAT 307 
He Lys He Phe Ala Gin Thr Asn He Val Ala Thr Pro Arg Asp Tyr 
55 60 65 

AAT GAA GAA GAT TTT AAA GTT ATT GCA AGA AAA GAA GTA TTT TCA ACT 355 
Asn Glu Glu Asp Phe Lys Val He Ala Arg Lys Glu Val Phe Ser Thr 
70 ' 75 80 85 

GGA'CTA ATG ATC GAA CTT ATT GAC AAA TGC TTA GTT GAA CTT CTT TCA 403 
Gly Leu Met He Glu Leu He Asp Lys Cys Leu Val Glu Leu Leu Ser 
90 95 100 

TCA AGC GAT GTT TCA GAT AGA CAA AAA CTT CAA TGA TTT GGA TTT CAA 4 51 

Ser Ser Asp Val Ser Asp Arg Gin Lys Leu Gin Cys Phe Gly Phe Gin 
105 110 115 

CTT AAG GGA AAT CAA TTA GCA AAG ACC CAT TTA TTA ACA GCT CTT TCA 4 99 

Leu Lys Glv Asn Gin Leu Ala Lys Thr His Leu Leu Thr Ala Leu Ser 
120 125 130 

ACT CAA .AAG CAG TAT TTC TTT CAA GAC GAA TGG AAC CAA GTT AGA GCA 54 7 

Thr Gin Lys Gin Tyr Phe Phe Gin Asp Glu Trp Asn Gin Val Arg Ala 
135 140 145 

ATG ATT GGA AAT GAG CTC TTC CGA CAT CTC TAC ACT AAA TAT TTA ATA 5 95 

Men lie Gly Asn Glu Leu Phe Arg His Leu Tyr Thr Lys Tyr Leu He 
150 ' 155 160 165 

TTC CAG CGA ACT TCT GAA GGA ACT CTT GTT CAA TTT TGC GGG AAT AAC 64 3 

Phe Gin Arg Thr Ser Glu Gly Thr Leu Val Gin Phe Cys Gly Asn Asn 
170 175 ' 180 

GTT TTT GAT CAT TTG AAA GTC AAC GAT AAG TTT GAC AAA AAG CAA AAA 691 
Val Phe Asp His Leu Lys Val Asn Asp Lys Phe Asp Lys Lys Gin Lys 
185 190 195 

GGT GGA GCA GCA GAC ATG AAT GAA CCT CGA TGT TGA TCA ACC TGC AAA 739 
Gly Gly Ala Ala Asp Met Asn Glu Pro Arg Cys Cys Ser Thr Cys Lys 
200 205 210 

TAC AAT GTC AAG AAT GAG AAA GAT CAC TTT CTC AAC AAC ATC AAC GTG 787 
Tyr Asn Val Lys Asn Glu Lys Asp His Phe Leu Asn Asn He Asn Val 
215 ' 220 225 

CCG AAT TGG AAT AAT ATG AAA TCA AGA ACC AGA ATA TTT TAT TGC ACT 835 
Pro Asn Trp Asn Asn Met Lys Ser Arg Thr Arg lie Phe Tyr Cys Thr 
230 - 235 240 245 

CAT TTT AAT AGA AAT AAC CAA TTC TTC AAA AAG CAT GAG TTT GTG AGT 88 3 

His Phe Asn Arg Asn Asn Gin Phe Phe Lys Lys His Glu Phe Val Ser 
250 255 260 



AAC AAA AAC AAT ATT TCA GCG ATG GAC AG A GCT CAG ACG ATA TTC ACG 
Asn Lys Asn Asn lie Ser Ala Met Asp Arg Ala Gin Thr lie Phe Thr 
265 270 275 



931 



AAT 
Asn 



ATA 
He 



TTC 
Phe 
280 



AGA TTT 
Arg Phe 



AAT AGA 
Asn Arg 



ATT AGA 
He Arg 
285 



AAG 
Lys 



AAG CTA 
Lys Leu 



AAA GAT AAG GTT 
Lys Asp Lys Val 
290 



979 



ATC 
He 



GAA 
Glu 

295 



AAA 
Lys 



ATT GCC 
He Ala 



TAC ATG 
Tyr Met 
300 



CTT GAG 
Leu Glu 



AAA 
Lys 



GTC AAA 
Val Lys 
305 



GAT TTT AAC TTC 
Asp Phe Asn Phe 



1027 



AAC 
Asn 
310 



TAC 
Tyr 



TAT 
Tyr 



TTA ACA 
Leu Thr 



AAA TCT 
Lys Ser 
315 



TGT CCT 
Cys Pro 



CTT 
Leu 



CCA GAA 
Pro Glu 
320 



AAT TGG 
Asn Trp 



CGG GAA 
Arg Glu 
325 
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CGG 
Arg 



AAA 
Lys 



CAA 
Gin 



AAA ATC 
Lys lie 
330 



GAA AAC 
Glu Asn 



TTG ATA 
Leu He 



AAT 
Asn 
335 



AAA ACT 
Lys Thr 



AGA GAA 
Arg Glu 



GAA AAG 
Glu Lys 
340 



1123 



TCG 

Ser 



AAG 
Lys 



TAC 
Tyr 



TAT GAA 
Tyr Glu 

345 



GAG CTG 
Glu Leu 



TTT AGC 
Phe Ser 
350 



TAC 
Tyr 



ACA ACT 
Thr Thr 



GAT AAT 
Asp Asn 
355 



AAA TGC 
Lys Cys 
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GTC 



ACA 
Th r 



Gin 
360 



TTT ATT 
Phe lie 



AAT GAA 
Asn Glu 



TTT TTC 
Phe Phe 
365 



TAC 

Tvr 



AAT ATA 
Asn lie 



CTC CCC 
Leu Pro 
370 



AAA GAC 
Lys Asp 



1219 



TTG 
Leu 
375 



ACT 
Th r 



GGA AGA 
Gly Arg 



AAC CGT. 
Asn Arg 
380 



AAG AAT 
Lys Asn 



TTT 
Phe 



CAA AAG 
Gin Lys 
385 



AAA GTT 
Lys Val 



AAG AAA 
Lys Lys 



1267 



l yr 
3 90 



GTG 
Val 



GAA 
Glu 



CTA AAC 
Leu Asn 



AAG CAT 
Lys His 
395 



GAA CTC 
Glu Leu 



ATT 



CAC AAA 
His Lys 
400 



AAC TTA TTG 
Asn Leu Leu 



CTT 
Leu 
4 05 



1315 



GAG AAG ATC 



i- V 3 



AAT ACA 
Asn Thr 
4 10 



AGA GAA 
Arg Glu 



ATA TCA 
He Ser 



TGG 
Tr? 
415 



ATG CAG 
Net Gin 



GTT 
Val 



GAG ACC 
Glu Thr 
420 



TCT 

Ser 



1363 



GCA 
Ala 



Lys 



CAT 
His 



TTT TAT 
Phe Tyr 
425 



TAT TTT 
Tyr Phe 



GAT CAC 
Asp His 
430 



GAA 
Glu 



AAC ATC 
Asn lie 



TAC 
Tyr 



GTC TTA 
Val Leu 
435 



TGG 
Tro 



1411 



AAA 
Lys 



TTG 
Leu 



CTC 
Leu 
4 40 



CGA TGG 
Arg Trp 



ATA TTC 
lie Phe 



GAG GAT 
Glu Asp 
445 



CTC 
Leu 



GTC GTC 
Val Val 



TCG 
Ser 
450 



CTG ATT 
Leu lie 



AGA 
Arg 



1459 



TGA 
Cys 



TTT 
Phe 
4 55 



TTC 
Phe 



TAT GTC 
Tyr Val 



ACC GAG 
Thr Glu 
460 



CAA CAG 
Gin Gin 



AAA 
Lvs 



AGT TAC 
Ser Tyr 
465 



TCC 
Ser 



AAA ACC 
Lys Thr 



TAT 
Tyr 



1507 



TAC TAC AGA AAG AAT ATT TGG GAC GTC ATT ATG AAA ATG TCA ATC GCA 
Tyr Tyr Arg Lys Asn lie Trp Asp Val lie Met Lys Met Ser He Ala 
470 475 480 485 



1555 



GAC TTA AAG AAG GAA ACG CTT GCT GAG GTC CAA GAA AAA GAG GTT GAA 
Asp Leu Lys Lys Glu Thr Leu Ala Glu Val Gin Glu Lys Glu Val Glu 
490 495 500 



1603 
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GAA TGG AAA AAG TCG CTT GGA TTT GCA CCT GGA AAA CTC AGA CTA ATA 1651 
Glu Trp Lys Lys Ser Leu Gly Phe Ala Pro Gly Lys Leu Arg Leu He 
505 510 515 

CCG AAG AAA ACT ACT TTC CGT CCA ATT ATG ACT TTC AAT AAG AAG ATT 1699 
Pro Lys Lys Thr Thr Phe Arg Pro He Met Thr Phe Asn Lys Lys He 
520 525 530 

GTA AAT TCA GAC CGG AAG ACT ACA AAA TTA ACT ACA AAT ACG AAG TTA 17 4 7 

Val Asn Ser Asp Arg Lys Thr Thr Lys Leu Thr Thr Asn Thr Lys Leu 
535 540 545 

TTG AAC TCT CAC TTA ATG CTT AAG ACA TTG AAG AAT AGA ATG TTT AAA 17 95 

Leu Asn Ser His Leu Met Leu Lys Thr Leu Lys Asn Arg Met Phe Lys 
550 555 560 565 

GAT CCT TTT GGA TTC GCT GTT TTT AAC TAT GAT GAT GTA ATG AAA AAG 184 3 

Asp Pro Phe Gly Phe Ala Val Phe Asn Tyr Asp Asp Val Met Lys Lys 
570 575 580 

TAT GAG GAG TTT GTT TGC AAA TGG AAG CAA GTT GGA CAA CCA AAA CTC 18 91 

Tyr Glu Glu Phe Val Cys Lys Trp Lys Gin Val Gly Gin Pro Lys Leu 
585 * 590 595 

TTC TTT GCA ACT ATG GAT ATC GAA AAG TGA TAT GAT AGT GTA AAC AGA 193 9 

Phe Phe Ala Thr Met Asp He Glu Lys Cys Tyr Asp Ser Val Asn Arg 
600 605 610 

GAA AAA CTA TCA ACA TTC CTA AAA ACT ACT AAA TTA CTT TCT TCA GAT 198 7 

Glu Lys Leu Ser Thr Phe Leu Lys Thr Thr Lys Leu Leu Ser Ser Asp 
615 620 625 

TTC TGG ATT ATG ACT GCA CPA ATT CTA AAG AGA AAG AAT AAC ATA GTT 203 5 

Phe Trp He Met Thr Ala Gin lie Leu Lys Arg Lys Asn Asn He Val 
630 * 635 640 645 

ATC GAT TCG AAA AAC TTT AGA AAG AAA GAA ATG AAA GAT TAT TTT AGA 208 3 

lie Asp Ser Lys Asn Phe Arg Lys Lys Glu Met Lys Asp Tyr Phe Arg 
650 , 655 660 

CAG AAA TTC CAG AAG ATT GCA CTT GAA GGA GGA CAA TAT CCA ACC TTA 2131 
Gin Lys Phe Gin Lys He Ala Leu Glu Gly Gly Gin Tyr Pro Thr Leu 
665 " 670 675 

TTC AGT GTT CTT GAA AAT GAA CAA AAT GAC TTA AAT GCA AAG AAA ACA 217 9 

Phe Ser Val Leu Glu Asn Glu Gin Asn Asp Leu Asn Ala Lys Lys Thr 
680 685 690 

TTA ATT GTT GAA GCA AAG CAA AGA AAT TAT TTT AAG AAA GAT AAC TTA 2 22 7 

Leu He Val Glu Ala Lys Gin Arg Asn Tyr Phe Lys Lys Asp Asn Leu 
695 700 705 

CTT CAA CCA GTC ATT AAT ATT TGC CAA TAT AAT TAC ATT AAC TTT AAT 227 5 

Leu Gin Pro Val He Asn He Cys Gin Tyr Asn Tyr lie Asn Phe Asn 
710 715 720 725 

GGG AAG TTT TAT AAA CAA ACA AAA GGA ATT CCT CAA GGT CTT TGA GTT 2 32 3 

Gly Lys Phe Tyr Lys Gin Thr Lys Gly lie Pro Gin Gly Leu Cys Val 
730 735 740 
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TCA TCA ATT TTG TCA TCA TTT TAT TAT GCA ACA TTA GAG GAA AGC TCC 2371 
Ser Ser He Leu Ser Ser Phe Tyr Tyr Ala Thr Leu Glu Glu Ser Ser 
745 750 755 

TTA GGA TTC CTT AGA GAT GAA TCA ATG AAC CCT GAA AAT CCA AAT GTT 2419 
Leu Gly Phe Leu Arg Asp Glu Ser Met Asn Pro Glu Asn Pro Asn Val 
760 765 770 

AAT CTT CTA ATG AGA CTT ACA GAT GAC TAT CTT TTG ATT ACA ACT CAA 24 67 

Asn Leu Leu Met Arg Leu Thr Asp Asp Tyr Leu Leu He Thr Thr Gin 
775 780 785 

GAG AAT AAT GCA GTA TTG TTT ATT GAG AAA CTT ATA AAC GTA AGT CGT 2 515 

Glu Asn Asn Ala Val Leu Phe He Glu Lys Leu He Asn Val Ser Arg 
790 795 800 805 

GAA AAT GGA TTT AAA TTC AAT ATG AAG AAA CTA CAG ACT AGT TTT CCA 2563 
Glu Asn Gly Phe Lys Phe Asn Met Lys Lys Leu Gin Thr Ser Phe Pro 
810 815 820 

TTA AGT CCA AGC AAA TTT GCA AAA TAC GGA ATG GAT AGT GTT GAG GAG 2611 
Leu Ser Pro Ser Lys Phe Ala Lys Tyr Gly Met Asp Ser Val Glu Glu 
825 830 835 

CAA AAT ATT GTT CAA GAT TAC TGC GAT TGG ATT GGC ATC TCA ATT GAT 2 659 

Gin Asn He Val Gin Asp Tyr Cys Asp Trp lie Gly lie Ser lie Asp 
840 845 850 

ATG AAA ACT CTT GCT TTA ATG CCA AAT ATT AAC TTG AGA ATA GAA GGA 27 07 

Ket Lys Thr Leu Ala Leu Met Pro Asn He Asn Leu Arg lie Glu Gly 
855 860 865 

ATT CTG TGT ACA CTC AAT CTA AAC ATG CAA ACA AAG AAA GCA TCA ATG 27 55 

lie Leu Cys Thr Leu Asn Leu Asn Met Gin Thr Lys Lys Ala Ser Met: 
370 875 ■ 880 835 

TGG CTC AAG AAG AAA CTA AAG TCG TTT TTA ATG AAT AAC ATT ACC CAT 28 0 3 

Tco Leu Lys Lys Lvs Leu Lys Ser Phe Leu Met Asn Asn He Thr His 
890 * 395 900 

TAT TTT AGA AAG ACG ATT ACA ACC GAA GAC TTT GCG AAT AAA ACT CTC 28 51 

Tyr Phe Arg Lys Thr lie Thr Thr Glu Asp Phe Ala Asn Lys Thr Leu 
905 910 515 

AAC AAG TTA TTT ATA TCA GGC GGT TAC AAA TAC ATG CAA. TGA GCC AAA 28 99 

Asn Lys Leu Phe He Ser Gly Gly Tyr Lys Tyr Met Gin Cys Ala Lys 
920 925 930 

GAA TAC AAG GAC CAC TTT AAG AAG AAC TTA GCT ATG AGC AGT ATG ATC 2 94 7 

Giu Tyr Lys Asp His Phe Lys Lys Asn Leu Ala Met Ser Ser Met He 
935 940 945 

GAC TTA GAG GTA TCT AAA ATT ATA TAC TCT GTA ACC AGA GCA TTC TTT 2995 
Asp Leu Giu Val Ser Lys lie lie Tyr Ser Val Thr Arg Ala Phe Phe 
950 955 960 965 

AAA TAC CTT GTG TGC AAT ATT AAG GAT ACA ATT ^TTT GGA GAG GAG CAT 304 3 

Lys Tyr Leu Val Cys Asn lie Lys Asp Thr lie Phe Gly Glu Glu His 
970 975 980 
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TAT CCA GAC TTT TTC CTT AGC ACA CTG AAG CAC TTT ATT GAA ATA TTC 3091 
Tyr Pro Asp Phe Phe Leu Ser Thr Leu Lys His Phe lie Glu lie Phe 
985 990 995 

AGC ACA AAA AAG TAC ATT TTC AAC AGA GTT TGC ATG ATC CTC AAG GCA 3139 
Ser Thr Lys Lys Tyr lie Phe Asn Arg Val Cys Met lie Leu Lys Ala 
1000 1005 1010 

AAA GAA GCA AAG CTA AAA AGT GAC CAA TGT CAA TCT CTA ATT CAA TAT 3187 
Lys Glu Ala Lys Leu Lys Ser Asp Gin Cys Gin Ser Leu lie Gin Tyr 
1015 1020 1025 

GAT GCA TAGTCGACTA TTCTAACTTA TTTTGGAAAG TTAATTTTCA ATTTTTGTCT 32 4 3 

Asp Ala 
1030 

TATATACTGG GGTTTTGGGG TTTTGGGGTT TTGGGG 327 9 



(2) INFORMATION FOR SEQ ID NO: 110: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1031 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear f 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 110: 

Met Glu Val Asp Val Asp Asn Gin Ala Asp Asn His Gly He His Ser 
1 k 5 10 15 

Aia Leu Lys Thr Cys Glu Glu He Lys Glu Ala Lys Thr Leu Tyr Ser 
20 25 30 

Trp He Gin Lys Val He Arg Cys Arg Asn Gin Ser Gin Ser His Tyr 
35 40 45 

Lys Asp Leu Glu Asp He Lys He Phe Ala Gin Thr Asn He Val Ala 
50 55 60 

Thr Pro Arg Asp Tyr Asn Glu Glu Asp Phe Lys Val He Ala Arg Lys 
65 70 75 80 

Glu Val Phe Ser Thr Gly Leu Met He Glu Leu He Asp Lys Cys Leu 
85 90 95 

Val Glu Leu Leu Ser Ser Ser Asp Val Ser Asp Arg Gin Lys Leu Gin 
100 105 110 

Cvs Phe Gly Phe Gin Leu Lys Gly Asn Gin Leu Ala Lys Thr His Leu 
115 120 125 

Leu Thr Ala Leu Ser Thr Gin Lys Gin Tyr Phe Phe Gin Asp Glu Trp 
130 135 140 



Asn Gin Val Arg Ala Met He Gly Asn Glu 
145 150 



Leu 
155 



Phe Arg His Leu Tyr 
160 
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Thr Lys Tyr Leu lie Phe Gin Arg Thr Ser Glu Gly Thr Leu Val Gin 
165 170 175 

Phe Cys Gly Asn Asn Val Phe Asp His Leu Lys Val Asn Asp Lys Phe 
180 * 185 190 

Asp Lys Lys Gin Lys Gly Gly Ala Ala Asp Met Asn Glu Pro Arg Cys 
195 200 205 

Cys Ser Thr Cys Lys Tyr Asn Val Lys Asn Glu Lys Asp His Phe Leu 
210 215 220 

Asn Asn lie Asn Val Pro Asn Trp Asn Asn Met Lys Ser Arg Thr Arg 
225 230 235 240 

lie Phe Tyr Cys Thr His Phe Asn Arg Asn Asn Gin Phe Phe Lys Lys 
245 250 255 

His Glu Phe Val Ser Asn Lys Asn Asn lie Ser Ala Met Asp Arg Ala 
260 265 270 

Gin Thr lie Phe Thr Asn lie Phe Arg Phe Asn Arg He Arg Lys Lys 
275 280 285 

Leu Lys Asp Lys Val He Glu Lys He Ala Tyr Met Leu Glu Lys Val 
290 * 295 300 

Lys Asp Phe Asn Phe Asn Tyr Tyr Leu Thr Lys Ser Cys Pro Leu Pro 
305 ^ 310 315 320 

Glu Asn Trp Arg Glu Arg Lys Gin Lys He Glu Asn Leu He Asn Lys 
325 330 335 

7'r.L- Arg Glu Glu Lys Ser Lys Tyr Tyr Glu Glu Leu Phe Ser Tyr Thr 
340 * ' 34 5 350 

Thr Asp Asn Lys Cys Val Thr Gin Phe He Asn Glu Phe Phe Tyr Asn 
355 ^ 360 ■ 365 

lie Leu Pro Lys Asp Phe Leu Thr Gly Arg Asn Arg Lys Asn Phe Gin 
370 375 380 

Lys Lys Val Lys Lys Tyr Val Giu Leu Asn Lys His Glu Leu lie His 
385 390 395 400 

Lys Asn Leu Leu Leu Glu Lys lie Asn Thr Arg Glu lie Ser Trp Met 
405 410 415 

Gin Val Glu Thr Ser Ala Lys His Phe Tyr Tyr Phe Asp His Glu Asn 
420 425 430 

lie Tyr Val Leu Trp Lys Leu Leu Arg Trp lie Phe Glu Asp Leu Val 
435 440 445 

Val Ser Leu lie Arg Cys Phe Phe Tyr Val Thr Glu Gin Gin Lys Ser 
450 455 460 



Tyr Ser Lys -Thr Tyr Tyr Tyr Arg Lys Asn lie Trp Asp Val lie Met 
465 470 475 480 
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Lys Met Ser He Ala Asp Leu Lys Lys Glu Thr Leu Ala Glu Val Gin 
485 490 495 

Glu Lys Glu Val Glu Glu Trp Lys Lys Ser Leu Gly Phe Ala Pro Gly 
500 505 510 

Lys Leu Arg Leu He Pro Lys Lys Thr Thr Phe Arg Pro He Met Thr 
515 520 525 

Phe Asn Lys Lys He Val Asn Ser Asp Arg Lys Thr Thr Lys Leu Thr 
530 535 540 

Thr Asn Thr Lys Leu Leu Asn Ser His Leu Met Leu Lys Thr Leu Lys 
545 550 555 560 

Asn Arg Met Phe Lys Asp Pro Phe Gly Phe Ala Val Phe Asn Tyr Asp 
565 570 575 

Asp Val Met Lys Lys Tyr Glu Glu Phe Val Cys Lys Trp Lys Gin Val 
580 585 590 

Gly Gin Pro Lys Leu Phe Phe Ala Thr Met Asp He Glu Lys Cys Tyr 
595 600 605 

Asp Ser Val Asn Arg Glu Lys Leu Ser Thr Phe Leu Lys Thr Thr Lys 
610 615 620 

Leu Leu Ser Ser Asp Phe Trp He Met Thr Ala Gin He Leu Lys Arg 
625 630 635 640 

Lys Asn Asn He Val He Asp Ser Lys Asn Phe Arg Lys Lys Glu Met 
645 650 655 

Lys Asp Tyr Phe Arg Gin Lys Phe Gin Lys He Ala Leu Glu Gly Gly 
660 665 670 

Gin Tyr Pro Thr Leu Phe Ser Val Leu Glu Asn Glu Gin Asn Asp Leu 
675 680 635 

Asn Ala Lys Lys Thr Leu He Val Glu Ala Lys Gin Arg Asn Tyr Phe 
690 695 700 

Lys Lys Asp Asn Leu Leu Gin Pro Val lie Asn He Cys Gin Tyr Asn 
705 710 715 "■ . 720 

Tyr He Asn Phe Asn Gly Lys Phe Tyr Lys Gin Thr Lys Gly He Pro 
725 730 735 

Gin Gly Leu Cys Val Ser Ser He Leu Ser Ser Phe Tyr Tyr Ala Thr 
740 745 750 

Leu Glu Glu Ser Ser Leu Gly Phe Leu Arg Asp Glu Ser Met Asn Pro 
755 760 765 

Glu Asn Pro Asn Val Asn Leu Leu Met Arg Leu Thr Asp Asp Tyr Leu 
770 775 780 



Leu He Thr Thr Gin Glu Asn Asn Ala Val Leu Phe He Glu Lys Leu 
785 790 795 800 
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lie Asn Val Ser Arg Glu Asn Gly Phe Lys Phe Asn Met Lys Lys Leu 
805 810 815 

Gin Thr Ser Phe Pro Leu Ser Pro Ser Lys Phe Ala Lys Tyr Gly Met 
820 825 830 

Asp Ser Val Glu Glu Gin Asn He Val Gin Asp Tyr Cys Asp Trp He 
835 840 845 

Gly He Ser He Asp Met Lys Thr Leu Ala Leu Met Pro Asn He Asn 
850 855 860 

Leu Arg He Glu Gly He Leu Cys Thr Leu Asn Leu Asn Met Gin Thr 
865 870 875 880 

Lys Lys Ala Ser Met Trp Leu Lys Lys Lys Leu Lys Ser Phe Leu Met 
885 890 895 

Asn Asn He Thr His Tyr Phe Arg Lys Thr He Thr Thr Glu Asp Phe 
900 905 910 

Ala Asn Lys Thr Leu Asn Lys Leu Phe He Ser Gly Gly Tyr Lys Tyr 
915 920 925 

Met Gin Cys Ala Lys Glu Tyr Lys Asp His Phe Lys Lys Asn Leu Ala 
930 * 935 940 

Met Ser Ser Met He Asp Leu Glu Val Ser Lys lie He Tyr Ser Val 
945 950 955 960 

Thr Arg Ala Phe Phe Lys Tyr Leu Val Cys Asn He Lys Asp Thr He 
965 970 975 

Phe Gly Glu Glu His Tyr Pro Asp Phe Phe Leu Ser Thr Leu Lys His 
930 ' 985 990 

Phe He Glu He Phe Ser Thr Lys Lys Tyr He Phe Asn Arg Val Cys 
995 1000 1005 

Met He Leu Lys Ala Lys Glu Ala Lys Leu Lys Ser Asp Gin Cys Gin 
1010 ^ 1015 1020 

Ser Leu He Gin Tyr Asp Ala 
1025 ■ 1030 



(2) INFORMATION FOR SEQ ID NO : 1 11 : 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 5544 base pairs 
(3) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 



(ii) 



MOLECULE TYPE: DMA (genomic) 
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(ix) FEATURE: 

(A) NAME/KEY : CDS 

(B) LOCATION: j oin { 95 9 . . 12 1 6 , 1273.. 1353, 1425.. 1543, 

1595. .1857, 1894. .2286, 2326. .2396, 2436. .2705, 
2746. .2862, 2914. .3083, 3125. .3309, 3356. .3504, 
3546.-3759, 3797. .4046, 4086. .4252, 4296. .4392, 
4435. .4597) 

(D) OTHER INFORMATION: /note= "Schi zosaccharomyces pombe 

telomerase catalytic subunit (TRT) " 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 111: 

GGTACCGATT TACTTTCCTT TCTTCATAAG CTAATTGCTT CCTCGAACGC TCCTAAATCT 60 

CTGGAAATAT TTTTACAAGA ACTCAATAAC AATACCAAGT CAAATTCCAA TATGAAGGTG 120 

TTATTAGTGA TCGATAATAT TTCTATTTTA TCGGTCGTTA CCAAGTATAA GGACAAAAAG 180 

AACAACTTCC TTCCCCCTAA AGACTTTTAC TTTATTAATT TACTTTTCAA ATATATTTCG 24 0 

GGTTCGCTTA CTTTTAATCG TGGTACTGTT TTAGCTGCTA CTTCTAGCCA ACCGCGTGTT 300 

TCTACCCCGT CATTGGATAT AGCTCTTGGA GTAGCTCACA GAAATCCTTA CAAATCTTCT 3 60 

GAT GAG ACT A TATTAGATTC ATTACAGTCC GTGCATATTC TTAACATGGA GCCTTACACT 4 20 

TTAGATGAGT CACGTCGCAT GATGGAGTAT TTGGTATCAT CCAACGTTTG CCTTGAAAAG 480 

GT TGATAATT ATTTGCAAAA TCATGTCCTT AGTGGTGGTA ATCCGCGAAA GTTTTTTGAT 5 40 

GCTTGCACAC GTCTAGCATG ATTGAGATAT TCAAAAATTT CTATCCACTA CAACTCCTTT 600 

AACGCGGTTT TATTTTTCTA TTTTCTATTC TCATGTTGTT CCAAATATGT ATCATCTCGT 6 60 

AT TAGGCTTT TTTCCGTTTT ACTCCTGGAA TCGTACCTTT TTCACTATTC CCCCTAATGA 7 20 

AT AAT CT AAA TTAGTTTCGC TTATAATTGA TAG TAG TAG A. AAGATTGGTG ATTCTACTCG 7 80 

TGTAATGTTA TTAGTTTAAA GATACTTTGC AAAACATTTA TTAGCTATCA TTATATAAAA 840 

AAAATCCTAT AATTATAAAT ATTAATCAAT ATTTGCGGTC ACTATTTATT TAAAACGTTA 900 

TGATCAGTAG GACACTTTGC AT AT AT AT AG TTATGCTTAA TGGTTACTTG TAACTTGC 9 58 

ATG ACC GAA CAC CAT ACC CCC AAA AGC AGG ATT CTT CGC TTT CTA GAG 100 6 
Met Thr Glu His His Thr Pro Lys Ser Arg lie Leu Arg Phe Leu Glu 
1 5 10 15 

AAT CAA TAT GTA TAC CTA TGT ACC TTA AAT GAT TAT GTA CAA CTT GTT 10 54 
Asn Gin Tyr Val Tyr Leu Cys Thr Leu Asn Asp Tyr Val Gin Leu Val 
20 25 30 

TTG AG A GGG TCG CCG GCA AGC TCG TAT AGC AAT ATA TGC GAA CGC TTG 1102 
Leu Arg Gly Ser Pro Ala Ser Ser Tyr Ser Asn lie Cys Glu Arg Leu 
35 40 45 

AG A AGC GAT GTA CAA ACG TCC TTT TCT ATT TTT CTT CAT TCG ACT GTA 1150 
Arg Ser Asp Val Gin Thr Ser Phe Ser lie Phe Leu His Ser Thr Val 
50 55 60 
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GTC GGC TTC GAC AGT AAG CCA GAT GAA GGT GTT CAA TTT TCT TCT CCA 1198 
Val Gly Phe Asp Ser Lys Pro Asp Glu Gly Val Gin Phe Ser Ser Pro 
65 70 75 80 

AAA TGC TCA CAG TCA GAG GTATATATAT TTTTGTTTTG ATTTTTTTCT 124 6 

Lys Cys Ser Gin Ser Glu 
85 

ATTCGGGATA GCTAATATAT GGGCAG CTA ATA GCG AAT GTT GTA AAA CAG ATG 12 99 

Leu He Ala Asn Val Val Lys Gin Met 
90 95 

TTC GAT GAA AGT TTT GAG CGT CGA AGG AAT CTA CTG ATG AAA GGG TTT 134 7 

Phe Asp Glu Ser Phe Glu Arg Arg Arg Asn Leu Leu Met Lys Gly Phe 
100 105 110 

TCC ATG GTAAGGTATT CTAATTGTGA AATATTTACC TGCAATTACT GTTTCAAAGA 14 03 

Ser Met 



GATTGTATTT AACCGATAAA G AAT CAT GAA GAT TTT CGA GCC ATG CAT GTA 14 54 

Asn His Glu Asp Phe Arg Ala Met His Val 
115 120 

AAC GGA GTA CAA AAT GAT CTC GTT TCT ACT TTT CCT AAT TAC CTT ATA 1502 

Asn Gly Val Gin Asn Asp Leu Val Ser Thr Phe Pro Asn Tyr Leu He 

125 130 135 

TCT ATA CTT GAG TCA AAA AAT TGG CAA CTT TTG TTA GAA AT 154 3 

Ser He Leu Glu Ser Lys Asn Trp Gin Leu Leu Leu Glu He 
140 145 150 

G7AAATACCG GTTAA.GATGT TGCGCACTTT GAACAAGACT GACAAGTATA G T ATC 15 98 

He 



GGC AGT GAT GCC ATG CAT TAC TTA TTA TCC AAA GGA AGT ATT TTT GAG 164 6 

Gly Ser Asp Ala Met His Tyr Leu Leu Ser Lys Gly Ser He Phe Glu 

155 160 165 170 

GCT CTT CCA AAT GAC AAT TAC CTT CAG ATT TCT GGC ATA CCA CTT TTT 169 4 

Ala Leu Pro Asn Asp Asn Tyr Leu Gin He Ser Gly He Pro Leu Phe 

175 180 185 

AAA AAT AAT GTG TTT GAG GAA ACT GTG TCA AAA AAA AGA AAG CGA ACC 17 4 2 

Lys Asn Asn Val Phe Glu Glu Thr Val Ser Lys Lys Arg Lys Arg Thr 

190 195 2-00 ' 

ATT GAA ACA TCC ATT ACT CAA AAT AAA AGC GCC CGC AAA GAA GTT TCC 17 90 

He Glu Thr Ser He Thr Gin Asn Lys Ser Ala Arg Lys Glu Val Ser 

205 210 215 

TGG AAT AGC ATT TCA ATT AGT AGG TTT AGC ATT TTT TAC AGG TCA TCC 18 38 

Trp Asn Ser lie Ser He Ser Arg Phe Ser He Phe Tyr Arg Ser Ser 
220 225 230 

TAT AAG AAG TTT AAG CAA G GTAACTAATA CTGTTATCCT TCATAACTAA 18 8 7 

Tyr Lys Lys Phe Lys Gin 
235 ' 240 



( 
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TTTTAG AT CTA TAT TTT AAC TTA CAC TCT ATT TGT GAT CGG AAC ACA 1934 
Asp Leu Tyr Phe Asn Leu His Ser lie Cys Asp Arg Asn Thr 
245 250 

GTA CAC ATG TGG CTT CAA TGG ATT TTT CCA AGG CAA TTT GGA CTT ATA 1982 
Val His Met Trp Leu Gin Trp lie Phe Pro Arg Gin Phe Gly Leu lie 
255 260 265 270 

AAC GCA TTT CAA GTG AAG CAA TTG CAC AAA GTG ATT CCA CTG GTA TCA 2030 
Asn Ala Phe Gin Val Lys Gin Leu His Lys Val lie Pro Leu Val Ser 
275 280 285 

CAG AGT ACA GTT GTG CCC AAA CGT CTC CTA AAG GTA TAC CCT TTA ATT 2078 
Gin Ser Thr Val Val Pro Lys Arg Leu Leu Lys Val Tyr Pro Leu lie 
290 295 300 

GAA CAA ACA GCA AAG CGA CTC CAT CGT ATT TCT CTA TCA AAA GTT TAC 2126 
Glu Gin Thr Ala Lys Arg Leu His Arg lie Ser Leu Ser Lys Val Tyr 
305 310 315 

AAC CAT TAT TGC CCA TAT ATT GAC ACC CAC GAT GAT GAA AAA ATC CTT 217 4 

Asn His Tyr Cys Pro Tyr lie Asp Thr His Asp Asp Glu Lys lie Leu 
320 325 330 

AGT TAT TCC TTA AAG CCG AAC CAG GTG TTT GCG TTT CTT CGA TCC ATT 2222 
Ser Tyr Ser Leu Lys Pro Asn Gin Val Phe Ala Phe Leu Arg Ser lie 
335 340 345 ' 350 

CTT GTT CGA GTG TTT CCT AAA TTA ATC TGG GGT AAC CAA AGG ATA TTT 2270 
Leu Val Arg Val Phe Pro Lys Leu lie Trp Gly Asn Gin Arg lie Phe 
355 360 365 

GAG ATA ATA TTA AAA G GTATTGTATA AAATTTATTA CCACTAACGA TTTTACCAG AC 

2 3 2 7 

Glu He lie Leu Lys Asp 
370 

CTC GAA ACT TTC TTG AAA TTA TCG AG A TAC GAG TCT TTT AGT TTA CAT 2 37 5 

Leu Glu Thr Phe Leu Lys Leu Ser Arg Tyr Glu Ser Phe Ser Leu His 
375 380 385 

TAT TTA ATG AGT AAC ATA AAG GTAATATGCC AAATTTTTTT ACCATTAATT 24 2 6 

Tyr Leu Met Ser Asn He Lys 
390 395 

AACAATCAG ATT TCA GAA ATT GAA TGG CTA GTC CTT GGA AAA AGG TCA 24 74 

He Ser Glu He Glu Trp Leu Val Leu Gly Lys Arg Ser 
400 405 

AAT GCG AAA ATG TGC TTA AGT GAT TTT GAG AAA CGC AAG CAA ATA TTT 2 52 2 

Asn Ala Lys Met Cys Leu Ser Asp Phe Glu Lys Arg Lys Gin He Phe 
410 415 420 

GCG GAA TTC ATC TAC TGG CTA TAC AAT TCG TTT ATA ATA CCT ATT TTA 2 570 

Ala Glu Phe lie Tyr Trp Leu Tyr Asn Ser Phe He He Pro He Leu 
425 430 435 440 



CAA T'CT 
Gin Ser 



TTT TTT TAT ATC ACT GAA TCA AGT GAT TTA CGA AAT CGA ACT 
Phe Phe Tyr He Thr Glu Ser Ser Asp Leu Arg Asn Arg Thr 
445 450 455 



2618 
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GTT TAT TTT AGA AAA GAT ATT TGG AAA CTC TTG TGC CGA CCC TTT ATT 2 6 66 

Val Tyr Phe Arg Lys Asp lie Trp Lys Leu Leu Cys Arg Pro Phe lie 
460 465 470 

ACA TCA ATG AAA ATG GAA GCG TTT GAA AAA ATA AAC GAG GTATTTTAAA 2715 
Thr Ser Met Lys Met Glu Ala Phe Glu Lys lie Asn Glu 
475 480 485 

GTATTTTTTG CAAAAAGCTA ATATTTTCAG AAC AAT GTT AGG ATG GAT ACT CAG 27 69 

Asn Asn Val Arg Met Asp Thr Gin 
490 

AAA ACT ACT TTG CCT CCA GCA GTT ATT CGT CTA TTA CCT AAG AAG AAT 2817 
Lys Thr Thr Leu Pro Pro Ala Val lie Arg Leu Leu Pro Lys Lys Asn 
495 500 505 

ACC TTT CGT CTC ATT ACG AAT TTA AGA AAA AGA TTC TTA ATA AAG 2862 
Thr Phe Arg Leu lie Thr Asn Leu Arg Lys Arg Phe Leu lie Lys 
510 515 520 

GTATTAATTT TTGGTCATCA ATGTACTTTA CTTCTAATCT ATTATTAGCA G ATG GGT 2919 

Met Gly 
525 

TCA AAC AAA AAA ATG TTA GTC AGT ACG AAC CAA ACT TTA CGA CCT GTG 2 967 

Ser Asn Lys Lys Met Leu Val Ser Thr Asn Gin Thr Leu Arg Pro Val 
530 535 540 

GCA TCG ATA CTG AAA CAT TTA ATC AAT GAA GAA AGT AGT GGT ATT CCA 3015 
Ala Ser lie Leu Lys His Leu He Asn Glu Glu Ser Ser Gly He Pro 
545 550 555 

TTT AAC TTG GAG GTT TAC ATG AAG CTT CTT ACT TTT AAG AAG GAT CTT 3063 
Phe Asn Leu Glu Val Tyr Met Lys Leu Leu Thr Phe Lys Lys Asp Leu 
560 565 570 

CTT AAG CAC CGA ATG TTT GG GTAATTATAT AATGCGCGAT TCCTCATTAT 3113 
Leu Lys [{is A r g M e t Phe Gly 
575 " 580 

TAATTTTGCA G G CGT AAG AAG TAT TTT GTA CGG ATA GAT ATA AAA TCC 3161 
Arg Lys Lys Tyr Phe Val Arg He Asp He Lys Ser 
585 590 

TGT TAT GAT CGA ATA AAG CAA GAT TTG ATG TTT CGG ATT GTT AAA AAG 3209 
Cys Tyr Asp Arg He Lys Gin Asp Leu Met Phe Arg He Val Lys Lys 
595 600 605 

AAA CTC AAG GAT CCC GAA TTT GTA ATT CGA AAG TAT GCA ACC ATA CAT 3 2 57 

Lys Leu Lys Asp Pro Glu Phe Val He Arg Lys Tyr Ala Thr lie His 
610 615 620 625 

GCA ACA AGT GAC CGA GCT ACA AAA AAC TTT GTT AGT GAG GCG TTT TCC 330 5 

Ala Thr Ser Asp Arg Ala Thr Lys Asn Phe Val Ser Glu Ala Phe Ser 
630 635 640 

TAT T GTAAGTTTAT TTTTTCATTG GAATTTTTTA ACAAA.TTCTT TTTTAG TT 3357 
Tyr Phe 
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GAT ATG GTG CCT TTT GAA AAA GTC GTG CAG TTA CTT TCT ATG AAA ACA 34 05 

Asp Met Val Pro Phe Glu Lys Val Val Gin Leu Leu Ser Met Lys Thr 
645 650 655 

TCA GAT ACT TTG TTT GTT GAT TTT GTG GAT TAT TGG ACC AAA AGT TCT 34 53 

Ser Asp Thr Leu Phe Val Asp Phe Val Asp Tyr Trp Thr Lys Ser Ser 

660 665 670 675 

TCT GAA ATT TTT AAA ATG CTC AAG GAA CAT CTC TCT GGA CAC ATT GTT 3501 

Ser Glu He Phe Lys Met Leu Lys Glu His Leu Ser Gly His He Val 
680 685 690 

AAG GTATACCAAT TGTTGAATTG TAATAACACT AATGAAACTA G ATA GGA AAT 3554 
Lys He Gly Asn 

695 

TCT CAA TAC CTT CAA AAA GTT GGT ATC CCT CAG GGC TCA ATT CTG TCA 3602 

Ser Gin Tyr Leu Gin Lys Val Gly He Pro Gin Gly Ser He Leu Ser 
700 705 710 

TCT TTT TTG TGT CAT TTC TAT ATG GAA GAT TTG ATT GAT GAA TAC CTA 3650 

Ser Phe Leu Cys His Phe Tyr Met Glu Asp Leu lie Asp Glu Tyr Leu 
715 720 725 

TCG TTT ACG AAA AAG AAA GGA TCA GTG TTG TTA CGA GTA GTC GAC GAT 3698 

Ser Phe Thr Lys Lys Lys Gly Ser Val Leu Leu Arg Val Val Asp Asp 
730 735 740 

TTC CTC TTT ATA ACA GTT AAT AAA AAG GAT GCA AAA AAA TTT TTG AAT 37 4 6 

Phe Leu Phe He Thr Val Asn Lys Lys Asp Ala Lys Lys Phe Leu Asn 
745 750 755 

TTA TCT TTA AG A G GTGAGTTGCT GTCATTCCTA AGTTCTAACC G TTG AAG GA 37 98 

Leu Ser Leu Arg Gly 

760 

TTT GAG AAA CAC AAT TTT TCT ACG AGC CTG GAG AAA ACA GTA ATA AAC 38 4 6 

Phe Glu Lys His Asn Phe Ser Thr Ser Leu Glu. Lys Thr Val He Asn 

765 " ■ 770 775 780 

TTT GAA AAT AGT AAT GGG ATA ATA .AAC AAT ACT TTT TTT AAT GAA AGC 3894 

Phe Glu Asn Ser Asn Gly He He Asn Asn Thr Phe Phe Asn Glu Ser 
785 790 795 

AAG AAA AG A ATG CCA TTC TTC GGT TTC TCT GTG AAC ATG AGG TCT CTT 3 94 2 

Lys Lys Arg Met Pro Phe Phe. Gly Phe Ser Val Asn Met Arg Ser Leu 
800 805 810 

GAT ACA TTG TTA GCA TGT CCT AAA ATT GAT GAA GCC TTA TTT AAC TCT 3990 

Asp Thr Leu Leu Ala Cys Pro Lys He Asp Glu Ala Leu Phe Asn Ser 
815 820 825 

ACA TCT GTA GAG CTG ACG AAA CAT ATG GGG AAA TCT TTT TTT TAC AAA 4 0 38 

Thr Ser Val Glu Leu Thr Lys His Met Gly Lys Ser Phe Phe Tyr Lys 
830 835 840 

ATT CTA AG GTATACTGTG TAACTGAATA ATAGCTGACA AAT AAT CAG A TCG 4 08 9 

lie Leu Arg Ser 

845 
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AGC CTT GCA TCC TTT GCA CAA GTA TTT ATT GAC ATT ACC CAC AAT TCA 4137 
Ser Leu Ala Ser Phe Ala Gin Val Phe He Asp He Thr His Asn Ser 
850 855 860 

AAA TTC AAT TCT TGC TGC AAT ATA TAT AGG CTA GGA TAC TCT ATG TGT 4185 
Lys Phe Asn Ser Cys Cys Asn He Tyr Arg Leu Gly Tyr Ser Met Cys 
865 870 875 880 

ATG AGA GCA CAA GCA TAC TTA AAA AGG ATG AAG GAT ATA TTT ATT CCC 4 233 

Met Arg Ala Gin Ala Tyr Leu Lys Arg Met Lys Asp He Phe He Pro 
885 890 895 

CAA AGA ATG TTC ATA ACG G GTGAGTACTT ATTTTAACTA GAAAAGTCAT 4 282 

Gin Arg Met Phe He Thr 
900 

TAATTAACCT TAG AT. CTT TTG AAT GTT ATT GGA AGA AAA ATT TGG AAA 4 330 

Asp Leu Leu Asn Val He Gly Arg Lys He Trp Lys 
905 910 

AAG TTG GCC GAA ATA TTA GGA TAT ACG AGT AGG CGT TTC TTG TCC TCT 4 37 8 

Lys Leu Ala Glu He Leu Gly Tyr Thr Ser Arg Arg Phe Leu Ser Ser 
915 920 925 930 

GCA GAA GTC AAA TG GTACGTGTCG GTCTCGAGAC TTCAGCAATA TTGACACATC 4 4 32 

Ala Glu Val Lys Trp 
935 

AG G CTT TTT TGT CTT GGA ATG AGA GAT GGT TTG AAA CCC TCT TTC AAA 4 4 80 

Leu Phe Cvs Leu Gly Met Arg Asp Gly Leu Lys Pro Ser Phe Lys 
940 945 950 

TAT CAT CCA TGC TTC GAA CAG CTA ATA TAC CAA TTT CAG TCA TTG ACT 4 528 

Tvr His Pro Cys Phe Glu Gin Leu He Tyr Gin Phe Gin Ser Leu Thr 
955 960 965 

GAT CTT ATC AAG CCG CTA AGA CCA GTT TTG CGA CAG GTG TTA TTT TTA . 4 57 6 

Asp Leu He Lys Pro Leu Arg Pro Val Leu Arg Gin Val Leu Phe Leu 
970 975 930 

CAT AGA AGA ATA GCT GAT TAATGTCATT TTCAATTTAT TAT AT AC ATC 4 624 
His Arg Arg He Ala Asp 
935 

CTTTATTACT GGTGTCTTAA ACAATATTAT TACTAAGTAT AGCTGACCCC CAAAGCAAGC 4 68 4 

AT ACT AT AGG ATTTCTAGTA AAGTAAAATT AATCTCGTTA TTAGTTTTGA TTGACTTGTC 4 74 4 

TTTATCCTTA TACTTTTAAG AAAGATTGAC AGTGGTTGCT GACTACTGCC CACATGCCCA 4 804 

TTAAACGGGA GTGGTTAAAC ATTAAAAGTA AT AC ATG AGG CTAATCTCCT TTCATTTAGA 4 8 64 

AT AAG G AAA G TGGTTTTCTA TAATGAATAA TGCCCGCACT AAT G C AAAAA GACGAAGATT 4 92 4 

ATCTTCTAAA CAAGGGGGAT TAAGCATATC CGAAGGAAAA GAGAGTAATA TACCCAGTGT 4 98 4 

TGTTGAAGAA AGCAAGGATA ATTTGGAACA AGCTTCTGCA GATGACAGGC TAAATTTTGG 504 4 

TGACCGAATT TTGGTAAAAG CCCCAGGTTA TCCATGGTGG CCGGCCTTGC TACTGAGACG 5104 

AAAAGAAACT AAGGATAGTT TGAATACTAA TAGCTCATTT AATGTCTTAT ATAAGGTTTT 5164 



GTTTTTTCCT GACTTCAATT TTGCATGGGT GAAAAGAAAT AGTGTTAAGC CATTATTGGA 5224 



TTCCGAAATA GCCAAATTTC TTGGTTCCTC AAAGCGGAAG TCTAAAGAAC TTATTGAAGC 528 4 

TTATGAGGCT TCAAAAACTC CTCCTGATTT AAAGGAGGAA TCTTCCACCG ATGAGGAAAT 5 34 4 

GGATAGCTTA TCAGCTGCTG AGGAGAAGCC TAATTTTTTG CAAAAAAGAA AATATCATTG 54 04 

GGAGACATCT CTTGATGAAT CAGATGCGGA GAGTATCTCC AGCGGATCCT TGATGTCAAT 54 64 

AACTTCTATT TCTGAAATGT ATGGTCCTAC TGTCGCTTCG ACTTCTCGTA GCTCTACGCA 5524 

GTTAAGTGAC CAAAGGTACC 554 4 



(2) INFORMATION FOR SEQ ID NO: 112: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 988 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 112: 

Men Thr Glu His His Thr Pro Lys Ser Arg He Leu Arg.Phe Leu Glu 
15 10 15 

Asn Gin Tyr Val Tyr Leu Cys Thr Leu Asn Asp Tyr Val Gin Leu Val 
20 25 30 

Leu Arg Gly Ser Pro Ala Ser Ser Tyr Ser Asn He Cys Glu Arg Leu 
35 40 * 45 

Arg Ser Asp Val Gin Thr Ser Phe Ser He Phe Leu His Ser Thr Val 
50 55 60 

Val Gly Phe Asp Ser Lys Pro Asp Glu Gly Val Gin Phe Ser Ser Pro 
65 * 70 75 80 

Lys Cys Ser Gin Ser Glu Leu He Ala Asn Val Val Lys Gin Met Phe 
85 90 95 

Asp Glu Ser Phe Glu Arg Arg Arg Asn Leu Leu Met Lys Gly Phe Ser 
100 105 110 

Met Asn His Glu Asp Phe Arg Ala Met His Val Asn Gly Val Gin Asn 
115 120 125 

Asp Leu Val Ser Thr Phe Pro Asn Tyr Leu lie Ser He Leu Glu Ser 
130 135 140 

Lys Asn Trp Gin Leu Leu Leu Glu He He Gly Ser Asp Ala Met His 
145 150 155 160 

Tyr Leu Leu Ser Lys Gly Ser He Phe Glu Ala Leu Pro Asn Asp Asn 
165 170 175 



Tyr Leu Gin lie Ser Gly He Pro Leu Phe Lys Asn Asn Val Phe Glu 
180 135 190 
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Glu Thr Val Ser Lys Lys Arg Lys Arg Thr He Glu Thr Ser He Thr 
195 200 205 

Gin Asn Lys Ser Ala Arg Lys Glu Val Ser Trp Asn Ser He Ser He 
210 215 220 

Ser Arg Phe Ser He Phe Tyr Arg Ser Ser Tyr Lys Lys Phe Lys Gin 
225 230 235 240 

Asp Leu Tyr Phe Asn Leu His Ser He Cys Asp Arg Asn Thr Val His 
245 250 255 

Met Trp Leu Gin Trp He Phe Pro Arg Gin Phe Gly Leu He Asn Ala 
260 265 270 

Phe Gin Val Lys Gin Leu His Lys Val He Pro Leu Val Ser Gin Ser 
275 . 280 285 

Thr Val Val Pro Lys Arg Leu Leu Lys Val Tyr Pro Leu He Glu Gin 
290 295 300 

Thr Ala Lys Arg Leu His Arg He Ser Leu Ser Lys Val Tyr Asn His 
305 310 315 320 

Tyr Cys Pro Tyr He Asp Thr His Asp Asp Glu Lys He Leu Ser Tyr 
325 330 335 

Ser Leu Lys Pro Asn Gin Val Phe Ala Phe Leu Arg Ser He Leu Val 
340 345 350 

Arc Val Phe Pro Lys Leu He Trp Gly Asn Gin Arg He Phe Glu He 
355 360 365 

lie Leu Lys Aso Leu Glu Thr Phe Leu Lys Leu Ser Arg Tyr Glu Ser 
370 ^ 375 380 

Phe Ser Leu His Tyr Leu Met Ser Asn He Lys He Ser Glu He Glu 
335 390 395 400 

Tro Leu Val Leu Gly Lys Arg Ser Asn Ala Lys Met Cys Leu Ser Asp 
405 410 415 

Phe Glu Lys Arg Lys Gin He Phe Ala Glu Phe He Tyr Trp Leu Tyr 
420 425 430 

Asn Ser Phe He He Pro He Leu Gin Ser Phe Phe Tyr lie Thr Glu 
435 440 445 

Ser Ser Asp Leu Arg Asn Arg Thr Val Tyr Phe Arg Lys Asp He Trp 
450 * 455- - 460 

Lvs Leu Leu Cys Arg Pro Phe He Thr Ser Met Lys Met Glu Ala Phe 
455 470 475 480 



Glu Lys lie Asn Glu Asn Asn Val Arg Met Asp Thr Gin Lys Thr Thr 
485 490 495 



Leu Pro Pro Ala Val lie Arg Leu Leu Pro Lys Lys Asn Thr Phe Arg 
500 505 510 
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Leu lie Thr Asn 
515 

Lys Lys Met Leu 
530 

lie Leu Lys His 
545 

Leu Glu Val Tyr 



His Arg Met Phe 
580 

Ser Cys Tyr Asp 
595 

Lys Lys Leu Lys 
610 

His Ala Thr Ser 

625 

Ser Tyr Phe Asp 



Met Lys Thr Ser 
660 

Lys Ser Ser Ser 
675 

His lie Val Lys 
590 

Pro Gin Gly Ser 

7 0 5 

Asp Leu I le Asp 



Leu Leu Arg Val 
740 

Asp Ala Lys Lys 
755 

Asn Phe Ser Thr 
770 

Asn Gly He He 

7S5 

Pro Phe Phe Gly 



Ala Cys Pro Lys 
820 



Leu Arg Lys Arg 
520 

Val Ser Thr Asn 
535 

Leu He Asn Glu 
550 

Met Lys Leu Leu 
565 

Gly Arg Lys Lys 



Arg He Lys Gin 
600 

Asp Pro Glu Phe 
615 

Asp Arg Ala Thr 
630 

Met Val Pro Phe 
645 

Asp Thr Leu Phe 



Glu lie Phe Lys 

680 

He Gly Asn Ser 
695 

He Leu Ser Ser 
710 

Glu Tyr Leu Ser 
725 

Val Asp Asp Phe 



Phe Leu Asn Leu 
760 

Ser Leu Glu Lys 
775 

Asn Asn Thr Phe 
790 

Phe Ser Val Asn 
805 

He Asp Glu Ala 



Phe Leu He Lys 



Gin Thr Leu Arg 
540 

Glu Ser Ser Gly 
555 

Thr Phe Lys Lys 
570 

Tyr Phe Val Arg 
585 

Asp Leu Met Phe 



Val lie Arg Lys 
620 

Lys Asn Phe Val 
635 

Glu Lys Val Val 
650 

Val Asp Phe Val 
665 

Met Leu Lys Glu 



Gin Tyr Leu Gin 
700 

Phe Leu Cys His 
715 

Phe Thr Lys Lys 
730 

Leu Phe lie Thr 
745 

Ser Leu Arg Gly 



Thr Val He Asn 
780 

Phe Asn Glu Ser 
795 

Met Arg Ser Leu 
810 

Leu Phe Asn Ser 
825 



Met Gly Ser Asn 
525 

Pro Val Ala Ser 



He Pro Phe Asn 
560 

Asp Leu Leu Lys 
575 

He Asp He Lys 
590 

Arg He Val Lys 
605 

Tyr Ala Thr lie 



Ser Glu Ala Phe 
640 

Gin Leu Leu Ser 
655 

Asp Tyr ~Trp Thr 
670 

His Leu Ser Gly 
685 

Lys Val Gly He 



Phe Tyr Met Glu 
720 

Lys Gly Ser Val 
735 

Val Asn Lys Lys 
750 

Phe Glu Lys His 
765 

Phe Glu Asn Ser 



Lys Lys Arg Met 
800 

Asp Thr Leu Leu 
815 

Thr Ser Val Glu 
830 
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Leu Thr Lys His Met Gly Lys Ser Phe Phe Tyr Lys lie Leu Arg Ser 
835 840 845 

Ser Leu Ala Ser Phe Ala Gin Vai Phe He Asp He Thr His Asn Ser 
850 855 860 

Lys Phe Asn Ser Cys Cys Asn He Tyr Arg Leu Gly Tyr Ser Met Cys 
865 870 875 880 

Met Arg Ala Gin Ala Tyr Leu Lys Arg Met Lys Asp He Phe He Pro 
885 890 ' 895 

Gin Arg Met Phe lie Thr Asp Leu Leu Asn Vai He Gly Arg Lys He 
900 905 910 

Trp Lys Lys Leu Ala Glu lie Leu Gly Tyr Thr Ser Arg Arg Phe Leu 
915 . 920 925 

Ser Ser Ala Glu Vai Lys Trp Leu Phe Cys Leu Gly Met Arg Asp Gly 
930 935 940 

Leu Lys Pro Ser Phe Lys Tyr His Pro Cys Phe Glu Gin Leu lie Tyr 
945 950 955 960 

Gin Phe Gin Ser Leu Thr Asp Leu He Lys Pro Leu Arg Pro Vai Leu 
965 970 975 

Arg Gin Vai Leu Phe Leu His Arg Arg lie Ala Asp 
980 985 

(2) INFORMATION FOR SEQ ID NO: 113: 

(i) SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 15 base pairs 
{3) TYPE: nucleic acid 

(C) STRANDEDMESS: single 

(D) TOPOLOGY:* linear 

(ii) MOLECULE TYPE: DMA (genomic) 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 113: 
CAAAACCCCA AAACC 15 



(2) INFORMATION FOR SEQ ID NO: 114: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 16 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDMESS : single 
{ D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DMA 



(xi) SEQUENCE DESCRIPTION: SEQ ID MO: 114: 



GGGGTTTTGG GGTTTT 



16 
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(2) INFORMATION FOR SEQ ID NO: 115: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1762 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 



(ix) FEATURE: 

(A) NAME/KEY: - 

(B) LOCATION: 1..1762 

(D) OTHER INFORMATION: /note* "gene encoding the 4 3 kDa 

telomerase protein subunit from 
Euplotes aediculatus" 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 115: 

CCCCAAAACC CCAAAACCCC AAAACCCCTA TAAAAAAAGA AAAAATTGAG GTAGTTTAGA 60 

AATAAAATAT TATTCCCGCA CAAATGGAGA TGGATATTGA TTTGGATGAT ATAGAAAATT 120 

TACTTCCTAA TACATTCAAC AAGTATAGCA GCTCTTGTAG TGACAAGAAA GGATGCAAAA 180 

CATTGAAATC TGGCTCGAAA TCGCCTTCAT TGACTATTCC AAAGTTGCAA AAACAATTAG 24 0 

AGTTCTACTT CTCGGATGCA AATCTTTATA ACGATTCTTT CTTGAGAAAA TTAGTTTTAA 300 

AAAGCGGAGA GCAAAGAGTA GAAATTGAAA CATTACTAAT GTTTAAATAA AATCAGGTAA 360 

TGAGGATTAT TCTATTTTTT AGATCACTTC TTAAGGAGCA TTATGGAGAA AATTACTTAA 420 

TACTAAAAGG TAAACAGTTT GGATTATTTC CCTAGCCAAC AATGATGAGT ATATTAAATT 4 80 

CA7ATGAGAA TGAGTCAAAG GATCTCGATA CATCAGACTT ACCAAAGACA AACTCGCTAT 54 0 

AAAACGCAAG AAAAAGTTTG ATAATCGAAC AGCAGAAGAA CTTATTGCAT TTACTATTCG 600 

TATGGGTTTT ATTACAATTG TTTTAGGTAT CGACGGTGAA CTCCCGAGTC TTGAGACAAT 660 

TGAAAAAGCT GTTTACAACT GAAGGAATCG CAGTTCTGAA AGTTCTGATG TGTATGCCAT 7 20 

TATTTTGTGA ATTAATCTCA AATATCTTAT CTCAATTTAA TGGATAGCTA TAGAAACAAA 780 

CCAAATAAAC CATGCAAGTT TAATGGAATA TACGTTAAAT CCTTTGGGAC AAATGCACAC 84 0 

TGAATTTATA TTGGATTCTT AAAGCATAGA TACACAGAAT GCTTTAGAGA CTGATTTAGC 900 

TTACAACAGA TTACCTGTTT TGATTACTCT TGCTCATCTC TTATATCTTT AAAAGAAGCA 960 

GGCGAAATGA AAAGAAGACT AAAG AAAG AG ATTTCAAAAT TTGTTGATTC TTCTGTAACC 102 0 

GGAATTAACA ACAAGAATAT TAGCAACGAA AAAGAAGAAG AGCTATCACA ATCCTGATTC 1080 

TTAAAGATTT CAAAAATTCC AGGTAAGAGA GATACATTCA TTAAAATTCA TATATTATAG 114 0 

TTTTTCATTT CACAGCTGTT ATTTTCTTTT ATCTTAACAA TATTTTTTGA TTAGCTGGAA 1200 
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GTAAAAAGTA TCAAATAAGA GAAGCGCTAG ACTGAGGTAA CTTAGCTTAT TCACATTCAT 12 60 

AGATCGACCT TCATATATCC AATACGATGA TAAGGAAACA GCAGTCATCC GTTTTAAAAA 1320 

TAGTGCTATG AGGACTAAAT TTTTAGAGTC AAGAAATGGA GCCGAAATCT TAATCAAAAA 138 0 

GAATTGCGTC GATATTGCAA AAGAATCGAA CTCTAAATCT TTCGTTAATA AGTATTACCA 14 4 0 

ATCTTGATTG ATTGAAGAGA TTGACGAGGC AACTGCACAG AAGATCATTA AAGAAATAAA 1500 

GTAACTTTTA TTAATTAGAG AATAAACTAA ATTACTAATA TAGAGATCAG CGATCTTCAA 1560 

TTGACGAAAT AAAAGCTGAA CTAAAGTTAG ACAATAAAAA ATACAAACCT TGGTCAAAAT 1620 

ATTGAGGAAG GAAAAGAAGA CCAGTTAGCA AAAGAAAAAA TAAGGCAATA AATAAAATGA 1680 

GTACAGAAGT GAAGAAATAA AAGATTTATT TTTTTCAATA ATTTATTGAA AAGAGGGGTT 174 0 

TTGGGGTTTT GGGGTTTTGG GG 1762 

(2) INFORMATION FOR SEQ ID NO: 116: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 10 amino acids 
(3) TYPE: amino acid 
(C) STRANDEDNESS: 
(0) TOPOLOGY: linear 

(ii) MOLECULE TYPE: peptide 

(>:i) SEQUENCE DESCRIPTION: SEQ ID NO: 116: 

Pro Gin Asn Pro Lys Thr Pro Lys Pro Leu 
1 5 10 

(2) INFORMATION FOR SEQ ID MO: 117: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 6 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: peptide 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 117: 

Lys Lys Lys Lys Leu Arg 
1 5 
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(2) INFORMATION FOR SEQ ID NO: 118: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 19 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: peptide 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 118: 

Phe Arg Asn Lys lie Leu Phe Pro His Lys Trp Arg Trp lie Leu lie 
15 10 15 

Trp Met lie 



(2) INFORMATION FOR SEQ ID NO: 119: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 41 amino acids 
(3) TYPE: amino acid 

(C) STRANDEDNESS: 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: peptide 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 119: 

Lys 1 1-2 Tyr Phe Leu He His Ser Thr Ser He Ala Ala Leu Val Val 
15 10 15 

Thr Arc Lys Asp Ala Lys His Cys Asn Leu Ala Arg Asn Arg Leu His 

2Q 25 ^ 30 

Cys Leu Phe Gin Ser Cys Lys Asn Asn 
35 40 



(2) INFORMATION FOR SEQ ID NO: 120: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 17 amino acids 
(3) TYPE: amino acid 

(C) STRANDEDNESS: 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: peptide 



(xi) SEQUENCE DESCRIPTION: SEQ ID MO: 120: 

Ser Ser Thr Ser Arg Met Gin He Phe He Thr lie Leu Ser Cys Glu 
15 10 15 



Asn 
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(2) INFORMATION FOR SEQ ID NO: 121: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 6 amino acids 

(B) TYPE:* amino acid 

(C) STRANDEDNESS: 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: peptide 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 121 

Lys Ala Glu Ser Lys Glu 
1 5 



(2) INFORMATION FOR SEQ ID NO: 122: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 5 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: 

{ D) TOPOLOGY : linear 

(ii) MOLECULE TYPE: peptide 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 122 

Lvs Leu Lys His Tyr 
T 5 



{2} INFORMATION FOR SEQ ID NO: 123: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 6 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: peptide 



(xi) SEQUENCE DESCRIPTION: SEQ ID MO: 123 

Cys Leu Asn Lys lie Arg 
1 5 



(2) INFORMATION FOR SEQ ID NO: 124: 

(i) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 19 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: peptide 
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(xij SEQUENCE DESCRIPTION: SEQ ID NO: 124: 

Cys Gly Leu Phe Tyr Phe Leu Asp His Phe Leu Arg Ser lie Met Glu 
15 10 15 

Lys lie Thr 



(2) INFORMATION FOR SEQ ID NO: 125: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 16 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: peptide 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO:125: 

Lys Val Asn Ser Leu Asp Tyr Phe Pro Ser Gin Gin Cys Cys Val Tyr 
15 10 15 



(2) INFORMATION FOR SEQ ID NO: 126: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 28 amino acids 
(E) TYPE: amino acid 

(C) STRANDEDNESS : 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: peptide 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 126: 

He His Met Arg Met Ser Gin Arg He Ser He His Gin Thr Tyr Gin 
15 10 15 

Arg Gin Thr Arg Tyr Lys Thr Gin Glu Lys Val Cys 
20 25 



(2) INFORMATION FOR SEQ ID NO: 127: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 56 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : 

(D) TOPOLOGY : linear 

(ii) MOLECULE TYPE: peptide 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 127: 



Ser Asn Ser Arg Arg Thr Tyr Cys He Tyr Tyr Ser Tyr Gly Phe Tyr 
15 10 15 
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Tyr Asn Cys Phe Arg Tyr Arg Arg 
20 

Cys Lys Ser Cys Leu Gin Leu Lys 
35 40 



Cys Val Cys His Tyr Phe Val Asn 
50 55 



(2) INFORMATION FOR SEQ ID NO: 128: 

( 
( 



Arg Leu Lys Lys Glu lie Ser 
85 

lie Asn Asn Lys Asn lie Ser 
100 

Ser Cys Phe Leu Lys lie Ser 
115 

lie Lys lie His lie Leu 
130 

(2) INFORMATION FOR SEQ ID NO: 129: 



Cys Thr Pro Glu Ser Cys Asp Asn 
25 30 

Glu Ser Gin Phe Cys Lys Phe Cys 
45 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 134 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: 

(D) TOPOLOGY: linear 

Li) MOLECULE TYPE: peptide 



<i) SEQUENCE DESCRIPTION: SEQ ID NO: 128: 

3er Gin He Ser Tyr Leu Asn Leu Met Asp Ser Tyr Arg Asn Lys Pro 
L 5 10 15 

^sn Lys Pro Cys Lys Phe Asn Gly He Tyr Val Lys Ser Phe Gly Thr 
20 25 30 

^sn Ala His Cys He Tyr He Gly Phe Leu Lys His Arg Tyr Thr Glu 
35 40 45 

Zys Phe Arg Asp Cys Phe Ser Leu Gin Gin He Thr Cys Phe Asp Tyr 
50 L 55 60 

Ser Cys Ser Ser Leu lie Ser Leu Lys Glu Ala Gly Glu Met Lys Arg 
65 70 75 ^ 30 

Lys Phe Val Asp Ser Ser Val Thr Gly 
90 95 

Asn Glu Lys Glu Glu Glu Leu Ser Gin 
105 110 

Lys He Pro Gly Lys Arg Asp Thr Phe 
120 125 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 17 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : 

{ D) TOPOLOGY : linear 

(ii) MOLECULE TYPE: peptide 
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(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 129: 

Phe Phe He Ser Gin Leu Leu Phe Ser Phe He Leu Thr He Phe Phe 
15 10 15 

Asp 



{2) INFORMATION FOR SEQ ID NO: 130: 

(i) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 7 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: peptide 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO:130: 

Leu Glu Val Lys Ser He Lys 
1 5 



(2) INFORMATION FOR SEQ ID NO: 131: 

(i) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 4 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: peptide 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 131: 

Thr Glu Val Thr 
1 



(2) INFORMATION FOR SEQ ID MO: 132: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 15 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: peptide 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 132: 

Leu He His He His Arg Ser Thr Phe He Tyr Pro He Arg Cys 
15 10 15 
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(2) INFORMATION FOR SEQ ID NO: 133: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 7 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: peptide 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO:133: 

Gly Asn Ser Ser His Pro Phe 
1 5 



(2) INFORMATION FOR SEQ ID NO: 134: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 4 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: peptide 



(xi) SEQUENCE DESCRIPTION-: SEQ ID NO: 134: 

Cys Tyr Glu Asp 
1 



;2; INFORMATION FOR SEQ ID NO: 135: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH : 25 amino acids 
(3) TYPE: amino acid 

(C) STRANDEDNESS: 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: peptide 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 135: 

lie Phe Arg Val Lys Lys Trp Ser Arg Asn Leu Asn Gin Lys Glu Leu 
1 5 ' 10 15 

Arg Arg Tyr Cys Lys Arg lie Glu Leu 
20 25 



(2) INFORMATION FOR SEQ ID NO: 136: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 19 amino acids 
(3) TYPE: amino acid 

(C) STRANDEDNESS : 

(D) TOPOLOGY: linear 
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(ii) MOLECULE TYPE : peptide 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 136: 

Val Leu Pro lie Leu lie Asp Cys Arg Asp Cys Arg Gly Asn Cys Thr 
15 10 15 

Glu Asp His 



(2) INFORMATION FOR SEQ ID NO:137: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 8 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: peptide 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 137: 

Arg Asn Lys Val Thr Phe lie Asn 
1 5 



(2) INFORMATION FOR SEQ ID NO: 138: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH : 13 ammo acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : 

ID) TOPOLOGY: linear 

(ii) MOLECULE TYPE: peptide 



(xi) SEQUENCE DESCRIPTION: SEQ ID MO; 138: 

lie Thr Asn lie Glu lie Ser Asp Leu Gin Leu Thr- Lys 
1 5 10 



(2) INFORMATION FOR SEQ ID MO: 139: 

(i) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 26 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: peptide 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 139: 



Thr lie Lys Asn Thr Asn Leu Gly Gin Asn He Glu Glu Gly Lys Glu 
15 10 15 
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Asp Gin Leu Ala Lys Glu Lys lie Arg Gin 
20 25 



(2) INFORMATION FOR SEQ ID NO: 140: 

(i) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 30 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: 

CD) TOPOLOGY: linear 

(ii) MOLECULE TYPE: peptide 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 140: 

lie Lys Cys Val Gin Lys Cys Arg Asn Lys Arg Phe lie Phe Phe Asn 
15 10 15 

Asn Leu Leu Lys Arg Gly Val Leu Gly Phe Trp Gly Phe Gly 
20 25 30 



(2) INFORMATION FOR SEQ ID NO: 14 1: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 42 amino acids 
(3) TYPE: amino acid 

(C) STRANDEDNESS: 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: peptide 



SEQUENCE DESCRIPTION: SEQ ID MO: 14 1: 

Pro Lys Thr Pro Lys Pro Gin Asn Pro Tyr Lys Lys Arg Lys Asn Cys 
1 5 10 15 

Gly Ser Leu Glu lie Lys Tyr Tyr Ser Arg Thr Asn Gly Asp Gly Tyr 
20 25 30^ 

Cys Phe Gly Cys Tyr Arg Lys Phe Thr Ser 
35 40 



(2) INFORMATION FOR SEQ ID NO: 142: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH : 5 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: 

(D) TOPOLOGY: linear 

(xi) MOLECULE TYPE: peptide 
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(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 142: 

Tyr He Gin Gin Val 
1 5 

(2) INFORMATION FOR SEQ ID NO: 143: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 36 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: peptide 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 143: 

Cys Gin Glu Arg Met Gin Asn He Glu He Trp Leu Glu He Ala Phe 
1 5 10 15 

He Asd Tyr Ser Lys Val Ala Lys Thr He Arg Val Leu Leu Leu Gly 
20 25 30 

Cys Lys Ser Leu 
35 



(2) INFORMATION FOR SEQ ID NO: 14 4: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 24 amino acids 

(3) TYPE: amino acid 

(C) STRANDEDNESS : 

{ D ) TOPOLOGY: linear 

(ii.) MOLECULE TYPE: peptide 



(>:i) SEQUENCE DESCRIPTION: SEQ IDNO:144: 

Arg Phe Phe Leu Glu Lys He Ser Phe Lys Lys Arg Arg Ala Lys Ser 
1 5 '10 15 

Arg Asn Cys Asn lie Thr Asn Val 
20 



(2) INFORMATION FOR SEQ ID NO: 14 5: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 11 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: peptide 
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(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 145: 

lie Lys Ser Gly Asn Glu Asp Tyr Ser lie Phe 
1 5 10 

(2) INFORMATION FOR SEQ ID NO: 146: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 12 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: peptide 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 146: 

Gly Ala Leu Trp Arg Lys Leu Leu Asn Thr Lys Arg 
15 10 



(2) INFORMATION FOR SEQ ID NO: 147: 

(i) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 108 amino acids 
(3) TYPE: amino acid 

(C) STRANDEDNESS: 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: peptide 



(:-:i) SEQUENCE DESCRIPTION: SEQ ID NO: 147: 

Thr Val Trp lie lie Ser Leu Ala Asn Asn Asp Glu Tyr lie Lys Phe 
1 '5 10 " ' 15 

lie Cys Glu Cys Val Lys Gly Ser Arg Tyr He Arg Leu Thr Lys Asp 
20 25 30 

Lys Leu. Ala He Lys Arg Lys Lys Lys Phe Asp Asn Arg Thr Ala Glu 
35 4 0 4 5 

Glu Leu He Ala Phe Thr He Arg Met Gly Phe He Thr He Val Leu 

50 55 60 

Gly He Asp Gly Glu Leu Pro Ser Leu Glu Thr He Glu Lys Ala Val 
65 70 75 80 

Tyr Asn Cys Arg Asn Arg Ser Ser Glu Ser Ser Asp Val Tyr Ala He 
85 90 95 



He Leu Cys He Asn Leu Lys Tyr Leu He Ser He 
100 105 



106 



(2) INFORMATION FOR SEQ ID NO: 14 8: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 33 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: peptide 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 148: 

Trp He Ala He Glu Thr Asn Gin He Asn His Ala Ser Leu Met Glu 
1 5 10 15 ' 

Tyr Thr Leu Asn Pro Leu Gly Gin Met His Thr Glu Phe lie Leu Asp 
20 25 30 

Ser 



(2) INFORMATION FOR SEQ ID NO: 14 9: 

(i) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 29 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: peptide 



(:-:i) SEQUENCE DESCRIPTION: SEQ ID MO: 149: 

Ser Lie Asp Thr Gin Asn Ala Leu Glu Thr Asp Leu Ala- Tyr Asn Arg 
1 5 10 15 

Leu Pro Val Leu He Thr Leu Ala His Leu Leu Tyr Leu 
20 25 



(2) INFORMATION FOR SEQ ID NO: 150: 

(i) SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 9 amino acids 
(3) TYPE: amino acid 

(C) STRANDEDNESS: 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: peptide 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 150: 



Lys Lys Gin Ala Lys Cys Lys Glu Asp 
1 * 5 
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(2) INFORMATION FOR SEQ ID NO: 151: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 11 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: peptide 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 151: 

Arg Lys Arg Phe Gin Asn Leu Leu lie Leu Leu 
1 5 10 



(2) INFORMATION FOR SEQ ID NO: 152: 

(i) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 21 amino acids 

(B) TYPE: amino acid 
CO STRANDEDNESS: 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: peptide 



(xi) SEQUENCE DESCRIPTION: SEQ ID MO: 152: 

Pro Giu Leu Thr Thr Arg lie Leu Ala Thr Lys Lys Lys Lys Ser Tyr 
1 5 10 15 

His Asn Pro Asp Ser 
20* 



[2] INFORMATION FOR SEQ ID MO: 153: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 30 amino acids 
(3) TYPE: amino acid 

(C) STRANDEDNESS : 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: peptide 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 153: 

Arg Phe Gin Lys Phe Gin Val Arg Glu lie His Ser Leu Lys Phe lie 
15 10 15 

Tyr Tyr Ser Phe Ser Phe His Ser Cys Tyr Phe Leu Leu Ser 
20 25 30 
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(2) INFORMATION FOR SEQ ID NO: 154: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 8 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: peptide 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 154: 

Gin Tyr Phe Leu lie Ser Trp Lys 
1 5 



(2) INFORMATION FOR SEQ ID NO: 155: 

( i ) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 11 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: 

(D) TOPOLOGY: linear 

■(ii) MOLECULE TYPE: peptide 



{xi) SEQUENCE DESCRIPTION: SEQ ID MO: 155: 

Lys Val Ser Asn Lys Arg Ser Ala Arg Leu Arg 
1 5 10 



INFORMATION FOR SEQ ID NO: 156: 

(i. ) SEQUENCE CHARACTERISTICS: 

(A) LENGTH : 87 ammo acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: peptide 



(xi) SEQUENCE DESCRIPTION: SEQ ID MO: 156: 

Leu Ser Leu Phe Thr Phe He Asp Arg Pro Ser Tyr He Gin Tyr Asp 
15 10 15 

Asp Lys Glu Thr Ala Val He Arg Phe Lys Asn Ser Ala Met Arg Thr 
20 25 30 

Lys Phe Leu Glu Ser Arg Asn Gly Ala Glu He Leu He Lys Lys Asn 
35 40 45 

Cys Val Asp He Ala Lys Glu Ser Asn Ser Lys Ser Phe Val Asn Lys 
50 ' 55 60 



Tyr Tyr Gin Ser Cys Leu He Glu Glu He Asp Glu Ala Thr Ala Gin 
65 70 75^ 80 
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Lys lie lie Lys Glu lie Lys 
85 



(2) INFORMATION FOR SEQ ID NO: 157: 

(i) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 6 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: peptide 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 157: 

Leu Leu Leu lie Arg Glu 
1 5 



(2) INFORMATION FOR SEQ ID NO: 158: 

(i) SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 5 amino acids 
(3) TYPE: amino acid 

(C) STRANDEDNESS : 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: peptide 



{:<i) SEQUENCE DESCRIPTION: SEQ iD NO: 158: 

Thr L--/3 Leu Leu lie 
1 * 5 



(2) INFORMATION FOR SEQ ID NO: 159: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 17 amino acids 
(3) TYPE: amino acid 

(C) STRANDEDNESS : 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: peptide 



(:-:i) SEQUENCE DESCRIPTION: SEQ ID NO: 159: 

Arg Ser Ala He Phe Asn Cys Arg Asn Lys Ser Cys Thr Lys Val Arg 



5 



10 



15 



Gin 
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(2) INFORMATION FOR SEQ ID NO: 160: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 16 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: peptide 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 160: 

Lys lie Gin Thr Leu Val Lys lie Leu Arg Lys Glu Lys Lys Thr Ser 
1 5 10 • 15 



(2) INFORMATION FOR SEQ ID NO: 161: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 4 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: peptide 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 161: 

Gin Lys Lys Lys 



(2) INFORMATION FOR SEQ ID NO: 162: 

" (i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH : 29 amino acids 
(3) TYPE: amino acid 

(C) STRANDEDNESS: 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: peptide 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 162: 

Asn Glu Tyr Arg Ser Glu Glu lie Lys Asp Leu Phe Phe Ser He He 
1 5 10 15 

Tyr Cys Lys Glu Gly Phe Trp Gly Phe Gly Val Leu Gly 
20 25 



(2) INFORMATION FOR SEQ ID NO: 163: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 18 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: 

(D) TOPOLOGY: linear 
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(ii) MOLECULE TYPE: peptide 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 163: 



Pro Lys Pro Gin Asn Pro Lys Thr Pro He Lys Lys Glu Lys He Glu 
15 10 15 



Val Val 



(2) INFORMATION FOR SEQ ID NO: 164: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 94 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: 

. (D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: peptide 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 164 : 

Asn lie He Pro Ala Gin Met Glu Met Asp He Asp Leu Asp Asp He 
15 10 15 

Glu Asn Leu Leu Pro Asn Thr Phe Asn Lys Tyr Ser Ser Ser Cys Ser 
20 25 30 

Asp Lys Lys Gly Cys Lys Thr Leu Lys Ser Gly Ser Lys Ser Pro Ser 

35 40 45 

Leu Thr He Pro Lys Leu Gin Lys Gin Leu Glu Phe Tyr Phe Ser Asp 

5 0 5 5 60 

Ala Asn Leu Tyr Asn Asp Ser Phe Leu Arg Lys Leu Val Leu Lys Ser 
65 ' 70 75 30 

Gly Glu Gin Arg Val Glu He Glu Thr Leu Leu Met Phe Lys 
35 90 



(2) INFORMATION FOR SEQ ID MO: 165: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 34 amino acids 
(3) TYPE: amino acid 

(C) STRANDEDNESS: 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: peptide 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 165: 

Asn Gin Val Met Arg He lie Leu Phe Phe Arg Ser Leu Leu Lys Glu 
1 5 10 15 
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His Tyr Gly Glu Asn Tyr Leu He Leu Lys Gly Lys Gin Phe Gly Leu 
20 25 30 

Phe Pro 



(2) INFORMATION FOR SEQ ID NO: 166: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 28 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: 

( p) TOPOLOGY: linear 

(ii) MOLECULE TYPE: peptide 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 166: 

Pro Thr Met Met Ser He Leu Asn Ser Tyr Glu Asn Glu Ser Lys Asp 
15 10 15 

Leu Aso Thr Ser Asp Leu Pro Lys Thr Asn Ser Leu 
20 25 



(2) INFORMATION FOR SEQ ID NO: 167: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 27 amino acids 
(3) TYPE: amino acid 

(C) STRANDEDNESS : 

(D) TOPOLOGY: linear 

( i i ) KG L EC U L E TYPE: peptide 



(:-:i) SEQUENCE DESCRIPTION: SEQ ID NO: 167: 

Asn Ala Arg Lys Ser Leu He He Glu Gin Gin Lys Asn Leu Leu His 
1 ' 5 10 15 

Leu Leu Phe Val Trp Val Leu Leu Gin Leu Phe 
20 25 



(2) INFORMATION FOR SEQ ID NO: 163: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 46 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: peptide 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 168: 

Val Ser Thr Val Asn Ser Arg Val Leu Arg Gin Leu Lys Lys Leu Phe 
1 5 10 .15 
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Thr Thr Glu Gly lie Ala Val Leu Lys Val Leu Met Cys Met Pro Leu 

20 25 30 

Phe Cys Glu Leu lie Ser Asn He Leu Ser Gin Phe Asn Gly 
35 40 45 



(2) INFORMATION FOR SEQ ID NO: 169: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 4 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: peptide 



(xi) SEQUENCE DESCRIPTION: SEQ ID MO: 169: 

Lys Gin Thr Lys 
1 



(2) INFORMATION FOR SEQ ID NO: 170: 

(i) SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 4 amino acids 
(3) TYPE: amino acid 

(C) STRANDEDNESS: 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: peptide 



fxi) SEQUENCE DESCRIPTION: SEQ ID NO: 170: 

Thr Met Gin Vai 
1 



(2) INFORMATION FOR SEQ ID MO: 171: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 4 amino acids 

(B) TYPE: amino acid 
CO STRANDEDNESS: 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: peptide 



(xi) SEQUENCE DESCRIPTION: SEQ ID MO: 171: 

Trp Asn lie Arg 
1 



114 



(2) INFORMATION FOR SEQ ID NO: 172: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 16 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: peptide 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 172: 

lie Leu Trp Asp Lys Cys Thr Leu Asn Leu Tyr Trp lie Leu Lys Ala 
15 10 15 



(2) INFORMATION FOR SEQ ID NO: 173: 

(i) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 5 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: peptide 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 173: 

lie His Arg Met Leu 

1 5 



[2] IN FORMATION FOR SEQ ID NO: 17 4: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 4 2 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: peptide 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 17 4: 

Leu Thr Thr Asp Tyr Leu Phe Cys Leu Leu Leu Leu lie Ser Tyr lie 
15 10 15 

Phe Lys Arg Ser Arg Arg Asn Glu Lys Lys Thr Lys Glu Arg Asp Phe 
20 25 30 



Lys lie Cys Cys Phe. Phe Cys Asn Arg Asn 
35 40 
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(2) INFORMATION FOR SEQ ID NO: 175: 

(i) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 4 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: peptide 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 175: 

Gin Gin Glu Tyr 

1 



(2) INFORMATION FOR SEQ ID NO: 17 6: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 20 amino acids 

(B) TYPE: amino acid 
{C) STRANDEDNESS: 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: peptide 



(xi) SEQUENCE DESCRIPTION: SEQ ID MO: 176: 

Gin Arg Lys Arg Arg Arg Ala lie Thr lie Leu lie Leu Lys Asp Phe 
1 5 10 15 

Lys Asn Ser Arg 
20 



(2) INFORMATION FOR SEQ ID MO: 177: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 5 amino acids 

(B) TYPE: amino acid 
'(C) STRANDEDNESS : 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: peptide 



(xi) SEQUENCE DESCRIPTION: SEQ ID MO: 177: 

Glu Arg Tyr He His 
1 5 



(2) INFORMATION FOR SEQ ID MO: 17 8: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 45 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: 

(D) TOPOLOGY: linear 
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(ii) MOLECULE TYPE: peptide 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 178: 

Asn Ser Tyr lie He Val Phe His Phe Thr Ala Val He Phe Phe Tyr 
15 10 15 

Leu Asn Asn lie Phe Cys Leu Ala Gly Ser Lys Lys Tyr Gin He Arg 
20 25 30 

Glu Ala Leu Asp Cys Gly Asn Leu Ala Tyr Ser His Ser 
35 40 45 



(2) INFORMATION FOR SEQ ID NO: 179: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 27 amino acids 
(3) TYPE: amino acid 

(C) STRANDEDNESS: 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: peptide 



(xi) SEQUENCE DESCRIPTION: SEQ ID MO: 179: 

lie Asp Leu His He Ser Asn Thr Met He Arg Lys Gin Gin Ser Ser 
I 5 10 15 

Val Leu Lys lie Val Leu Cys Gly Leu Asn Phe 
20 25 



INFORMATION FOR SEQ ID NO: 130: 

(i) SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 3 amino acids 
(3) TYPE: amino acid 
(C; STRANDEDNESS: 
(D) TOPOLOGY : linear 

(ii) MOLECULE TYPE: peptide 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 180: 

Ser Gin Glu Met Glu Pro Lys Ser 
1 5 



2) INFORMATION FOR SEQ ID NO: 181: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 4 1 amino acids 
(3) TYPE: amino acid 

(C) STRANDEDNESS: 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: peptide 
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(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 181: 

Ser Lys Arg lie Ala Ser lie Leu Gin Lys Asn Arg Thr Leu Asn Leu 

1 5 10 15 

Ser Leu lie Ser lie Thr Asn Leu Asp Cys Leu Lys Arg Leu Thr Arg 

20 25 30 

Gin Leu His Arg Arg Ser Leu Lys Lys 
35 40 



(2) INFORMATION FOR SEQ ID NO: 182: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 4 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: peptide 



Cxi) SEQUENCE DESCRIPTION: SEQ ID NO: 182: 

Ser Asn Phe Tyr 
1 



(2) INFORMATION FOR SEQ ID NO: 183: 

(i) SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 7 amino acids 
(3) TYPE: amino acid 
{C) STRANDEDNESS: 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: peptide 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 183: 

Leu Glu Asn Lys Leu Asn Tyr 
1 5 



(2) INFORMATION FOR SEQ ID NO: 184: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 53 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: peptide 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 184: 



■Tyr Arg Asp Gin Arg Ser Ser He Asp Glu He Lys Ala Glu Leu Lys 
15 10 15 



Leu Asp Asn Lys Lys Tyr Lys Pro Trp Ser Lys Tyr Cys Gly Arg Lys 
20 25 30 



Arg Arg Pro Val Ser Lys Arg Lys Asn Lys Ala lie Asn Lys Met Ser 
35 40 45 

Thr Glu Val Lys Lys 
50 



(2) INFORMATION FOR SEQ ID NO: 185: 

(i) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 6 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: peptide 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO:185: 

Lys lie Tyr Phe Phe Gin 
1 5 



(2) INFORMATION FOR SEQ ID NO: 186: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 13 amino acids 
(3} TYPE: amino acid 

(C) STRANDEDNESS : 

(D) TOPOLOGY: linear 

(ii; MOLECULE TYPE: peptide 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 186: 

Phe lie Glu Lys Arg Gly Phe Gly Val Leu Gly Phe Trp 
1 5 10 



(2) INFORMATION FOR SEQ ID NO: 187: 

(i) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 1007 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: peptide 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 187: 



Glu Val Asp Val Asp Asn Gin Ala Asp Asn His Gly He His Ser Ala 
1 ' 5 10 15 
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Leu Lys Thr Cys Glu Glu lie Lys Glu Ala Lys Thr Leu Tyr Ser Trp 
20 25 30 

He Gin Lys Val He Arg Cys Arg Asn Gin Ser Gin Ser His Tyr Lys 
35 40 45 

Asp Leu Glu Asp He Lys He Phe Ala Gin Thr Asn He Val Ala Thr 
50 55 60 

Pro Arg Asp Tyr Asn Glu Glu Asp Phe Lys Val He Ala Arg Lys Glu 
65 70 75 80 

Val Phe Ser Thr Gly Leu Met He Glu Leu He Asp Lys Cys Leu Val 
85 90 95 

Glu Leu Leu Ser Ser Ser Asp Val Ser Asp Arg Gin Lys Leu Gin Cys 
100 105 110 

Phe Gly Phe Gin Leu Lys Gly Asn Gin Leu Ala Lys Thr His Leu Leu 
115 120 125 

Thr Ala Leu Ser Thr Gin Lys Gin Tyr Phe Phe Gin Asp Glu Trp Asn 
130 135 140 

Gin val Arg Ala Met He Gly Asn Glu Leu Phe Arg His Leu Tyr Thr 
145 150 155 160 

Lys Tyr Leu He Phe Gin Arg Thr Ser Glu Gly Thr Leu Val Gin Phe 
165 170 175 

Cys Gly Asn Asn Val Phe Asp His Leu Lys Val Asn Asp Lys Phe Asp 
180 ^ 135 190 

Lys Lys Gin Lys Gly Gly Ala Ala Asp Met Asn Glu Pro Arg Cys Cys 
195 - 200 205 

Ser Thr Cys Lys Tyr Asn Val Lys Asn Glu Lys Asp His Phe Leu Asn 
210 215 ' 220 

Asn He Asn Val Pro Asn Trp Asn Asn Met Lys Ser Arg Thr Arc lie 
225 230 * 235 240 

Phe Tyr 'Cys Thr His Phe Asn Arg Asn Asn Gin Phe Phe Lys Lys His 
245 250 255 

Glu Phe Val Ser Asn Lys Asn Asn He Ser Ala Met Asp Arg Ala Gin 
260 265 270 

Thr lie Phe Thr Asn He Phe Arg Phe Asn Arg He Arg Lys Lys Leu 
275 230 235 

Lys Asp Lys Val lie Glu Lys He Ala Tyr Met Leu Glu Lys Val Lys 
290 295 300 

Asp Phe Asn Phe Asn Tyr Tyr Leu Thr Lys Ser Cys Pro Leu Pro Glu 
305 310 315 320 



Asn Trp Arg Glu Arg Lys Gin Lys He Glu Asn Leu He Asn Lys Thr 
325 330 335 
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Arg Glu Glu Lys Ser Lys Tyr Tyr Glu Glu Leu Phe Ser Tyr Thr Thr 
340 345 350 

Asp Asn Lys Cys Val Thr Gin Phe He Asn Glu Phe Phe Tyr Asn He 
355 360 365 

Leu Pro Lys Asp Phe Leu Thr Gly Arg Asn Arg Lys Asn Phe Gin Lys 
370 375 380 

Lys Val Lys Lys Tyr Val Glu Leu Asn Lys His Glu Leu He His Lys 
385 390 395 400 

Asn Leu Leu Leu Glu Lys He Asn Thr Arg Glu He Ser Trp Met Gin 
405 410 415 

Val Glu Thr Ser Ala Lys His Phe Tyr Tyr Phe Asp His Glu Asn He 
420 425 430 

Tyr Val Leu Trp Lys Leu Leu Arg Trp He Phe Glu Asp Leu Val Val 
435 440 445 

Ser Leu He Arg Cys Phe Phe Tyr Val Thr Glu Gin Gin Lys Ser Tyr 
450 455 460 

S<*r Lys Thr Tyr Tyr Tyr Arg Lys Asn He Trp Asp Val He Met Lys 
465 470 475 480 

Met Ser He Ala Asp Leu Lys Lys Glu Thr Leu Ala Glu Val Gin Glu 
435 490 495 

Lys Glu Val Glu Glu Trp Lys Lys Ser Leu Gly Phe Ala Pro Gly Lys 
500 505 -510 

Leu Arg Leu He Pro Lys Lys Thr Thr Phe Arg Pro lie Met Thr Phe 
515 ^ 520 525 

Asp. Lys Lys He Val Asn Ser Asp Arg Lys Thr Thr Lys Leu Thr Thr 
530 . 535 540 

Asn Thr Lys Leu Leu Asn Ser His Leu Met Leu Lys Thr Leu Lys Asn 
545 550 555 560 

Arg Met Phe Lys Asp Pro Phe Gly Phe Ala Val Phe Asn Tyr Asp Asp 
565 570 575 

Val Met Lys Lys Tyr Glu Glu Phe Val Cys Lys Trp Lys Gin Val Gly 
580 585 590 



Gin Pro Lys Leu Phe Phe Ala Thr Met Asp He Glu Lys Cys Tyr Asp 

595 600 605 

Ser Val Asn Arg Glu Lys Leu Ser Thr Phe Leu Lys Thr Thr Lys Leu 

610 ' 615 620 

L<=u Se*- Ser Asp Phe Trp He Met Thr Ala Gin He Leu Lys Arg Lys 

625 630 635 640 

Asn Asn He Val He Asp Ser Lys Asn Phe Arg Lys Lys Glu Met Lys 

645 650 655 
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Asp Tyr Phe Arg Gin Lys Phe Gin Lys He Ala Leu Glu Gly Gly Gin 
660 665 670 

Tyr Pro Thr Leu Phe Ser Val Leu Glu Asn Glu Gin Asn Asp Leu Asn 
675 680 685 

Ala Lys Lys Thr Leu He Val Glu Ala Lys Gin Arg Asn Tyr Phe Lys 
690 695 700 

Lys Asp Asn Leu Leu Gin Pro Val He Asn lie Cys Gin Tyr Asn Tyr 
705 710 715 720 

He Asn Phe Asn Gly Lys Phe Tyr Lys Gin Thr Lys Gly He Pro Gin 
725 730 • 735 

Gly Leu Cys Val Ser Ser He Leu Ser Ser Phe Tyr Tyr Ala Thr Leu 
740 745 750 

Glu Glu Ser Ser Leu Gly Phe Leu Arg Asp Glu Ser Met Asn Pro Glu 
755 760 "765 

Asn P^o Asn Val Asn Leu Leu Met Arg Leu Thr Asp Asp Tyr Leu Leu 
770 775 780 

He Thr Thr Gin Glu Asn Asn Ala Val Leu Phe lie Glu Lys Leu He 
785 790 795 800 

Asn Val Ser Arg Glu Asn Gly Phe Lys Phe Asn Met Lys Lys Leu Gin 
805 810 815 

Thr Ser Phe Pro Leu Ser Pro Ser Lys Phe Ala Lys Tyr Gly Met Asp 
820 325 830 

Ser Val Glu Glu Gin Asn He Val Gin Asp Tyr Cys Asp Trp lie Gly 
335 340 S-15 

lie Ser He Asp Met Lys Thr Leu Ala Leu Met Pro Asn lie Asn Leu 
850 * 855 860 

Arg He Glu Gly lie Leu Cys Thr Leu Asn Leu Asn Met Gin Thr Lys 
865 870 ' ' 875 880 

Lys Ala Ser Met Tro Leu Lys Lys Lys Leu Lys Ser Phe Leu Met Asn 
835 890 895 

Asn He Thr His Tyr Phe Arg Lys Thr He Thr Thr Glu Asp Phe Ala 
900 905 910 

Asn Lys Thr Leu Asn Lys Leu Phe lie Ser Gly Gly Tyr Lys Tyr Met 
915 920 925 

Gin Cvs Ala Lys Glu Tyr Lys Asp His Phe Lys Lys Asn Leu Ala Met 
930 " 935 940 

Ser Ser Met lie Asp Leu Glu Val Ser Lys He lie Tyr Ser Val Thr 
945 950 955 960 

Arg Ala Phe Phe Lys Tyr Leu Val Cys Asn lie Lys Asp Thr lie Phe 
965 . 970 975 
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Gly Glu Glu His Tyr Pro Asp Phe Phe Leu Ser Thr Leu Lys His Phe 
980 985 990 

lie Glu lie Phe Ser Thr Lys Lys Tyr lie Phe Asn Arg Val Cys 

995 1000 1005 



(2) INFORMATION FOR SEQ ID NO: 188: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 699 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: peptide 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 188: 

Glu Leu Glu Leu Glu Met Gin Glu Asn Gin Asn Asp He Gin Val Arg 
15 10 15 

Val Lys He Asp Asp Pro Lys Gin Tyr Leu Val Asn Val Thr Ala Ala 
20 25 30 

Cys Leu Leu Gin Glu Gly Ser Tyr Tyr Gin Asp Lys Asp Glu Arg Arg 
35 40 45 

Tyr He He Thr Lys Ala Leu Leu Glu Val Ala Glu Ser Asp Pro Glu 
50 55 60 

Phe He Cys Gin Leu Ala Val Tyr lie Arg Asn Glu Leu Tyr lie Arg 
65 70 75 30 

Thr Thr Thr Asn Tyr He Val Ala Phe Cys Val Val His Lys Asn Thr 
35 90 95 

Gin Pro Phe He Glu Lys Tyr Phe Asn Lys Ala Val Leu Leu Pro Asn 
100 105 110 

Asp Leu Leu Glu Val Cys Glu Phe Ala Gin Val Leu Tyr He Phe Asp 
115 120 125 

Ala Thr Glu Phe Lys Asn Leu Tyr Leu Asp Arg He Leu Ser Gin Asp 
130 135 140 

lie Arg Lys Glu Leu Thr Phe Arg Lys Cys Leu Gin Arg Cys Val Arg 
145 150 155 160 

Ser Lys Phe Ser Glu Phe Asn Glu Tyr Gin Leu Gly Lys Tyr Cys Thr 
165 170 175 

Glu Ser Gin Arg Lys Lys Thr Met Phe Arg Tyr Leu Ser Val Thr Asn 
180- 185 190 

Lys Gin Lys Trp Asp Gin Thr Lys Lys Lys Arg Lys Glu Asn Leu Leu 
195 200 205 



Thr Lys Leu Gin Ala He Lys Glu Ser Glu Asp Lys Ser Lys Arg Glu 
210 215 220 
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Thr Gly Asp He Met Asn Val Glu Asp Ala He Lys Ala Leu Lys Pro 
225 230 235 240 

Ala Val Met Lys Lys He Ala Lys Arg Gin Asn Ala Met Lys Lys His 
245 250 255 

M^t Lys Ala Pro Lys He Pro Asn Ser Thr Leu Glu Ser Lys Tyr Leu 
260 265 270 

Thr Phe Lys Asp Leu He Lys Phe Cys His lie Ser Glu Pro Lys Glu 
275 280 285 

Arg Val Tyr Lys lie Leu Gly Lys Lys Tyr Pro Lys Thr Glu Glu Glu 
290 295 300 

Tyr Lys Ala Ala Phe Gly Asp Ser Ala Ser Ala Pro Phe Asn Pro Glu 
305 310 315 320 

Lp U Ala Gly Lys Arg Met Lys He Glu lie Ser Lys Thr Trp Glu Asn 
325 330 335 

Glu Leu Ser Ala Lys Gly Asn Thr Ala Glu Val Trp Asp Asn Leu lie 
340 345 350 

Ser Ser Asn Gin Leu Pro Tyr Met Ala Met Leu Arg Asn Leu Ser Asn 
355 360 365 

Tie Leu Lys Ala Gly Val Ser Asp Thr Thr His Ser lie Val lie Asn 
370 375 380 

Lvs He Cys Glu Pro Lys Ala Val Glu Asn Ser Lys Met Phe Pro Leu 
3S5 390 395 400 

Gin Phe Phe Ser Ala He Glu Ala Val Asn Glu Ala Val Thr Lys Gly 
405 410 415 

Phe Lvs Ala Lys Lys Arg Glu Asn Met Asn Leu Lys Gly Gin He Glu 
4 2 0 ^25 430 

Ala Val Lys Glu Val Val Glu Lys Thr Asp Glu Glu Lys Lys Asp Met 
435 44 0 445 

Glu Leu Glu Gin Thr Glu Glu Gly Glu Phe Val Lys Val Asn Glu Gly 
450 455 460 

Hp Glv Lys Gin Tyr He Asn Ser lie Glu Leu Ala lie Lys He Ala 
465 ' 470 - 475 480 

Val Asn Lys Asn Leu Asp Glu He Lys Gly His Thr Ala lie Phe Ser 
485 490 495 

Asp Val Ser Gly Ser Met Ser Thr Ser Met Ser Gly Gly Ala Lys Lys 
500 505 510 

r vr c-'V Ser Val Arg Thr Cys Leu Glu Cys Ala Leu Val Leu Gly Leu 
515 * 520 525 

Met Val Lys Gin Arg Cys Glu Lys Ser Ser Phe Tyr lie Phe Ser Ser 
530 535 540 ' 
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Pro Ser Ser Gin Cys Asn Lys Cys Tyr Leu Glu Val Asp Leu Pro Gly 
545 550 555 560 

Asp Glu Leu Arg Pro Ser Met Gin Lys Leu Leu Gin Glu Lys Gly Lys 
565 570 575 

Leu Gly Gly Gly Thr Asp Phe Pro Tyr Glu Cys lie Asp Glu Trp Thr 
580 585 590 

Lys Asn Lys Thr His Val Asp Asn lie Val lie Leu Ser Asp Met Met 
595 600 605 

lie Ala Glu Gly Tyr Ser Asp lie Asn Val Arg Gly Ser Ser lie Val 
610 615 620 

Asn Ser lie Lys Lys Tyr Lys Asp Glu Val Asn Pro Asn lie Lys lie 
625 . 630 635 640 

Phe Ala Val Asp Leu Glu Gly Tyr Gly Lys Cys Leu Asn Leu Gly Asp 
645 650 655 

Glu Phe Asn Glu Asn Asn Tyr lie Lys lie Phe Gly Met Ser Asp Ser 
660 665 670 

He Leu Lys Phe He Ser Ala Lys Gin Gly Gly Ala Asn Met Val Glu 
675 680 635 



Val He Lys Asn Phe Ala Leu Gin Lys He Gly 
690 695 



(2) INFORMATION FOR SEQ ID NO: 18 9: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 897 amino acids 
{3} TYPE: amino acid 

(C) STRANDEDNESS: 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: peptide 



(xi) SEQUENCE DESCRIPTION: SEQ ID MO: 189: 

Leu Ser Thr Gin Lys Gin Tyr Phe Phe Gin Asp Glu Trp Asn Gin Val 
1 5 10 15 

Arg Ala Met He Gly Asn Glu Leu Phe Arg His Leu Tyr Thr Lys Tyr 
20 25 30 

Leu lie Phe Gin Arg Thr Ser Glu Gly Thr Leu Val Gin Phe Cys Gly 
35 40 45 

Asn Asn Val Phe Asp His Leu Lys Val Asn Asp Lys Phe Asp Lys Lys 
50 * 55 60 

Gin Lys Gly Gly Ala Ala Asp Met Asn Glu Pro Arg Cys Cys Ser Thr 

65 70 75 80 

Cys Lys Tyr Asn Val Lys Asn Glu Lys Asp His Phe Leu Asn Asn He 
85 ' 90 95 



Asn Val Pro Asn Trp Asn Asn Met Lys Ser Arg Thr Arg lie Phe Tyr 
100 105 110 



Cys Thr His Phe Asn Arg Asn Asn Gin Phe Phe Lys Lys His Glu Phe 
115 120 125 

Val Ser Asn Lys Asn Asn lie Ser Ala Met Asp Arg Ala Gin Thr lie 
130 135 140 

Phe Thr Asn lie Phe Arg Phe Asn Arg lie Arg Lys Lys Leu Lys Asp 
145 150 155 160 

Lys Val lie Glu Lys He Ala Tyr Met Leu Glu Lys Val Lys Asp Phe 
165 170 175 

Asn Phe Asn Tyr Tyr Leu Thr Lys Ser Cys Pro Leu Pro Glu Asn Trp 
180 185 190 

Arg Glu Arg Lys Gin Lys He Glu Asn Leu He Asn Lys Thr Arg Glu 
195 200 205 

Glu Lys Ser Lys Tyr Tyr Glu Glu Leu Phe Ser Tyr Thr Thr Asp Asn 
210 215 220 

Lys Cys Val Thr Gin Phe He Asn Glu Phe Phe Tyr Asn He Leu Pro 
225 230 235 240 

Lys Asp Phe Leu Thr Gly Arg Asn Arg Lys Asn Phe Gin Lys Lys Val 
245 250 255 

Lys Lys Tyr Val Glu Leu Asn Lys His Glu Leu He His Lys Asn Leu 
260 265 270 

Leu Leu Glu Lys He Asn Thr Arg Giu He Ser Trp Met Gin Val Glu 

275 .230 235 

Thr Ser Ala Lys His Phe Tyr Tyr Phe Asp His Glu Asn lie Tyr Val 
290 * 295 300 

Leu Trp Lys Leu Leu Arg Trp lie Phe Glu Asp Leu Val Val Ser Leu 
305 310 " 315 320 

lie Arg Cys Phe Phe Tyr Val Thr Giu Gin Gin Lys Ser Tyr Ser Lys 

325 330 335 

v 

Thr Tyr Tyr Tyr Arg Lys Asn lie Trp Asp Val He Met Lys Met Ser 
340 345 350 

lie Ala Asp Leu Lys Lys Glu Thr Leu Ala Glu Val Gin Glu Lys Glu 
355 360 365 

Val Glu Glu Trp Lys Lys Ser Leu Gly Phe Ala Pro Gly Lys Leu Arg 
370 375 380 



Leu lie Pro Lys Lys Thr Thr Phe Arg Pro lie Met Thr Phe Asn Lys 
335 390 395 400 



Lys He Val Asn Ser Asp Arg Lys Thr Thr Lys Leu Thr Thr Asn Thr 
405 410 415 
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Lys Leu Leu Asn Ser His Leu Met Leu Lys Thr Leu Lys Asn Arg Met 
420 425 430 

Phe Lys Asp Pro Phe Gly Phe Ala Val Phe Asn Tyr Asp Asp Val Met 
435 440 445 

Lys Lys Tyr Glu Glu Phe Val Cys Lys Trp Lys Gin Val Gly Gin Pro 
450 455 460 

Lys Leu Phe Phe Ala Thr Met Asp He Glu Lys Cys Tyr Asp Ser Val 
465 470 475 480 

Asn Arg Glu Lys Leu Ser Thr Phe Leu Lys Thr Thr Lys Leu Leu Ser 
485 490 495 

Ser Asp Phe Trp He Met Thr Ala Gin He Leu Lys Arg Lys Asn Asn 
500 505 510 

He Val He Asp Ser Lys Asn Phe Arg Lys Lys Glu Met Lys Asp Tyr 
515 520 525 

Phe Arg Gin Lys Phe Gin Lys lie Ala Leu Glu Gly Gly Gin Tyr Pro 
530 535 540 

Thr Leu Phe Ser Val Leu Glu Asn Glu Gin Asn Asp Leu Asn Ala Lys 
545 550 555 560 

Lys Thr Leu He Val Glu Ala Lys Gin Arg Asn Tyr Phe Lys Lys Asp 
565 570 575 

Asn Leu Leu Gin Pro Val He Asn He Cys Gin Tyr Asn Tyr He Asn 
580 585 590 

Phe Asn Gly Lys Phe Tyr Lys Gin Thr Lys Gly He Pro Gin Gly Leu 
595 600 605 

Cvs Val Ser Ser He Leu Ser Ser Phe Tyr Tyr Ala Thr Leu Glu Glu 
610 615 620 

Ser Ser Leu Gly Phe Leu Arg Asp Glu Ser Met Asn Pro Glu Asn Pro 
625 630 635 640 

Asn Val Asn Leu Leu Met Arg Leu Thr Asp Asp Tyr Leu Leu lie Thr 
645 650 655 

Thr Gin Glu Asn Asn Ala Val Leu Phe He Glu Lys Leu He Asn Val 
660 665 670 

Ser Arg Glu Asn Gly Phe Lys Phe Asn Met Lys Lys Leu Gin Thr Ser 
675 680 685 

Phe Pro Leu Ser Pro Ser Lys Phe Ala Lys Tyr Gly Met Asp Ser Val 
690 695 700 



Glu Glu Gin Asn He Val Gin Asp Tyr Cys Asp Trp lie Gly lie Ser 
705 710 715 720 



He Asp Met Lys Thr Leu Ala Leu Met Pro Asn lie Asn Leu Arc He 
725 730 735 
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Glu Gly lie Leu Cys Thr Leu Asn Leu Asn Met Gin Thr Lys Lys Ala 
740 745 750 

Ser Met Trp Leu Lys Lys Lys Leu Lys Ser Phe Leu Met Asn Asn He 
755 760 765 

Thr His Tyr Phe Arg Lys Thr He Thr Thr Glu Asp Phe Ala Asn Lys 
770 775 780 

Thr Leu Asn Lys Leu Phe He Ser Gly Gly Tyr Lys Tyr Met Gin Cys 
785 790 795 800 

Ala Lys Glu Tyr Lys Asp His Phe Lys Lys Asn Leu Ala Met Ser Ser 
805 810 815 

Met He Asp Leu Glu Val Ser Lys lie He Tyr Ser Val Thr Arc Ala 
820 825 830 

Phe Phe Lys Tyr Leu Val Cys Asn He Lys Asp Thr He Phe Gly Glu 
835 840 845 

Glu His Tyr Pro Asp Phe Phe Leu Ser Thr Leu Lys His Phe lie Glu 
850 855 860 

He Phe Ser Thr Lys Lys Tyr He Phe Asn Arg Val Cys Met He Leu 
865 870 875 380 

Lys Ala Lys Glu Ala Lys Leu Lys Ser Asp Gin Cys Gin Ser Leu lie 
885 890 895 

Gin 



(2) INFORMATION FOR SEQ ID NO: 190: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH : 840 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDMESS : 

(D) TOPOLOGY: linear 

(li) MOLECULE TYPE: peptide 



(xi) SEQUENCE DESCRIPTION: SEQ I 

Met Ser Arg Arg Asn Gin Lys Lys 
1 -5 

Thr Asn Leu Asp Phe VaHLeu Gin 
20 

lie Glu His Tyr Lys Thr Gin Gin 
35 40 

Lys Leu Leu Lys Phe Lys Asn Gin 
50 55 



i NO: 190: 

Pro Gin Ala Pro lie Gly Asn Glu 
10 15 

Asn Leu Glu Val Tyr Lys Ser Gin 
25 30 

Gin Gin He Lys Glu Glu Asp Leu 
45 

Asp Gin Asp Gly Asn Ser Gly Asn 
60 



Asp Asp Asp Asp Glu Glu Asn Asn Ser Asn Lys Gin Gin Glu Leu Leu 
65^ 70 75 30 
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Arg Arg Val Asn Gin He Lys Gin Gin Val Gin Leu He Lys Lys Val 
85 90 95 

Gly Ser Lys Val Glu Lys Asp Leu Asn Leu Asn Glu Asp Glu Asn Lys 
100 105 110 

Lys Asn Gly Leu Ser Glu Gin Gin Val Lys Glu Glu Gin Leu Arg Thr 
115 120 125 

He Thr Glu Glu Gin Val Lys Tyr Gin Asn Leu Val Phe Asn Met Asp 
130 135 140 

Tyr Gin Leu Asp Leu Asn Glu Ser Gly Gly His Arg Arg His Arg Arg 
145 150 155 160 

Glu Thr Asp Tyr Asp Thr Glu Lys Trp Phe Glu He Ser His Asp Gin 
165 170 175 

Lys Asn Tyr Val Ser He Tyr Ala Asn Gin Lys Thr Ser Tyr Cys Trp 
180 185 190 

Trp Leu Lys Asp Tyr Phe Asn Lys Asn Asn Tyr Asp His Leu Asn Val 
195 200 205 

Ser He Asn Arg Leu Glu Thr Glu Ala Glu Phe Tyr Ala Phe Asp Asp 

210 215 220 

Phe Ser Gin Thr He Lys Leu Thr Asn Asn Ser Tyr Gin Thr Val Asn 
225 230 235 240 

He Asp Val Asn Phe Asp Asn Asn Leu Cys He Leu Ala Leu Leu Arg 
245 250 255 

Phe Leu Leu Ser Leu Glu Arg Phe Asn lie Leu Asn lie Arg Ser Ser 
260 265 270 

Tyr Thr Arg Asn Gin Tyr Asn Phe Glu Lys lie Gly Giu Leu Leu Glu 
275 230 235 

Thr lie Phe Ala Val Val Phe Ser His Arg His Leu Gin Gly lie' His 
290 295 300 

Leu Gin Val Pro Cys Glu Ala Phe Gin Tyr Leu Val Asn Ser Ser Ser 
305 310 315 320 

Gin He Ser Val Lys Asp Ser Gin Leu Gin Val Tyr Ser Phe Ser Thr 
325 330 335 

Asp Leu Lys Leu Val Asp Thr Asn Lys Val Gin Asp Tyr Phe Lys Phe 
340 345 350 

Leu Gin Glu Phe Pro Arg Leu Thr His Val Ser Gin Gin Ala lie Pro 
355 360 365 

Val Ser Ala Thr Asn Ala Val Glu Asn Leu Asn Val Leu Leu Lys Lys 
370 375 380 



Val Lys His Ala Asn Leu Asn Leu Val Ser He Pro Thr Gin Phe Asn 
335 390 395 400 
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Phe Asp Phe Tyr Phe Val Asn Leu Gin His Leu Lys Leu Glu Phe Gly 
405 410 415 

Leu Glu Pro Asn He Leu Thr Lys Gin Lys Leu Glu Asn Leu Leu Leu 
420 425 430 

Ser He Lys Gin Ser Lys Asn Leu Lys Phe Leu Arg Leu Asn Phe Tyr 
435 440 445 

Thr Tyr Val Ala Gin Glu Thr Ser Arg Lys Gin He Leu Lys Gin Ala 
450 455 460 

Thr Thr lie Lys Asn Leu Lys Asn Asn Lys Asn Gin Glu Glu Thr Pro 
465 470 475 480 

Glu Thr Lys Asp Glu Thr Pro Ser Glu Ser Thr Ser Gly Met Lys Phe 
485 490 495 

Phe Asp His Leu Ser Glu Leu Thr Glu Leu Glu Asp Phe Ser Val Asn 
500 505 510 

Leu Gin Ala Thr Gin Glu He Tyr Asp Ser Leu His Lys Leu Leu lie 
515 520 525 

Arg Ser Thr Asn Leu Lys Lys Phe Lys Leu Ser Tyr Lys Tyr Glu Met 
530 535 540 

Glu Lvs Ser Lys Met Asp Thr Phe He Asp Leu Lys Asn He Tyr Glu 
545 ' 550 555 560 

Thr Leu Asn Asn Leu Lys Arg Cys Ser Val Asn He Ser Asn Pro His 
565 570 575 

Glv Asn lie Ser Tyr Glu Leu Thr Asn Lys Asp Ser Thr Phe Tyr Lys 
580 535 590 

?he Lvs Leu Thr Leu Asn Gin Glu Leu Gin His Ala Lys Tyr Thr Phe 
595 600 605 

Lys Gin Asn Giu Phe Gin Phe Asn Asn Val Lys Ser Ala Lys He Glu 
610 615 620 

Ser Ser Ser Leu Glu Ser Leu Glu Asp He Asp Ser Leu Cys Lys Ser 
625 630 635 640 

He Ala Ser Cys Lys Asn Leu Gin Asn Val Asn He He Ala Ser Leu 
645 650 655 

Leu Tyr Pro Asn Asn He Gin Lys Asn Pro Phe Asn Lys Pro Asn Leu 
660 665 670 

Leu Phe Phe Lys Gin Phe Glu Gin Leu Lys Asn Leu Glu Asn Val Ser 
675 680 685 

lie Asn Cys He Leu Asp Gin His He Leu Asn Ser He Ser Glu Phe 
690 695 700 



Leu Glu Lys Asn Lys Lys He Lys Ala Phe He Leu Lys Arg Tyr Tyr 
705 710 715 720 
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Leu Leu Gin Tyr Tyr Leu Asp Tyr Thr Lys Leu Phe Lys Thr Leu Gin 
725 730 735 

Gin Leu Pro Glu Leu Asn Gin Val Tyr lie Asn Gin Gin Leu Glu Glu 
740 745 750 

Leu Thr Val Ser Glu Val His Lys Gin Val Trp Glu Asn His Lys Gin 
755 760 765 

Lys Ala Phe Tyr Glu Pro Leu Cys Glu Phe lie Lys Glu Ser Ser Gin 
770 775 780 

Thr Leu Gin Leu lie Asp Phe Asp Gin Asn Thr Val Ser Asp Asp Ser 
785 790 795 800 

lie Lys Lys He Leu Glu Ser He Ser Glu Ser Lys Tyr His His Tyr 
805 810 815 

Leu Arg Leu Asn Pro Ser Gin Ser Ser Ser Leu He Lys Ser Glu Asn 
820 825 830 

Glu Glu He Gin Glu Leu Leu Lys 
835 840 



[2) INFORMATION FOR SEQ ID NO: 191: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 83^amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : 

(D) TOPOLOGY: linear 

(n) MOLECULE TYPE : peptide 



(xi) SEQUENCE DESCRIPTION: SEQ I! 

Asp lie Asp Leu Asp Asp lie Glu 
i ^5 

Lys Tyr Ser Ser Ser Cys Ser Asp 
20 

Ser Gly Ser Lys Ser Pro Ser Leu 
35 40 

Leu Glu Phe Tyr Phe Ser Asp Ala 
50 55 

Arg Lys Leu Val Leu Lys Ser Gly 
65 70 



> NO: 191: 

Asn Leu Leu Pro Asn Thr Phe Asn 
10 * 15 

Lys Lys Gly Cys Lys Thr Leu Lys 
25 30 

Thr He Pro Lys Leu Gin Lys Gin 

4 5 

Asn Leu Tyr Asn Asp Ser Phe Leu 
60 

Glu Gin. Arg Val Glu He Glu Thr 
75 80 



Leu 



Leu Met 



(2) INFORMATION FOR SEQ ID NO: 192: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 100 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: peptide 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 192: 

Asn Val Lys Ser Ala Lys lie Glu Ser Ser Ser Leu Glu Ser Leu Giu 
1 5 10 15 

Asp He Asp Ser Leu Cys Lys Ser He Ala Ser Cys Lys Asn Leu Gin 
20 25 30 

Asn Val Asn He He Ala Ser Leu Leu Tyr Pro Asn Asn He Gin Lys 
35 40 45 

Asn Pro Phe Asn Lys Pro Asn Leu Leu Phe Phe Lys Gin Phe Glu Gin 

50 55 60 

Leu Lys Asn Leu Glu Asn Val Ser He Asn Cys He Leu Asp Gin His 
65 70 75 80 

lie Leu Asn Ser He Ser Glu Phe Leu Glu Lys Asn Lys Lys lie Lys 
85 90 95 

Ala Phe He Leu 
100 



(2} INFORMATION FOR SEQ ID MO: 193: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 85 amino acids 
(3) TYPE: amino acid 

(C) STRANDEDNESS : 

(D) TOPOLOGY: linear 



[ii) MOLECULE TYPE: peptide 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 193: 

Met Glu Met Asp He Asp Leu Asd Asp He Glu Asn Leu Leu Pro Asn 
15 10 15 

Thr Phe Asn Lys Tyr Ser Ser Ser Cys Ser Asp Lys Lys Gly Cys Lys 
20 25 30 

Thr Leu Lys Ser Gly Ser Lys Ser Pro Ser Leu Thr He Pro Lys Leu 
35 40 45 



Gin Lys Gin Leu Glu Phe Tyr Phe Ser Asp Ala Asn Leu Tyr Asn Asp 
50 55 60 
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Ser Phe Leu Arg Lys Leu Val Leu Lys Ser Gly Glu Gin Arg Val Glu 
65 70 75 80 

lie Glu Thr Leu Leu 
85 



(2) INFORMATION FOR SEQ ID NO: 194: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 99 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: peptide 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 194: 

He Glu Leu Ala He Lys He Ala Val Asn Lys Asn Leu Asp Glu He 
15 10 15 

Lys Gly His Thr Ala He Phe Ser Asp Val Ser Gly Ser Met Ser Thr 
20 25 30 

Ser Met Ser Gly Gly Ala Lys Lys Tyr Gly Ser Val Arg Thr Cys Leu 
35 40 45 

Glu Cys Ala Leu Val Leu Gly Leu Met Val Lys Gin Arg Cys Glu Lys 
50 55 60 

Ser Ser Phe Tyr lie Phe Ser Ser Pro Ser Ser Gin Cys Asn Lys Cys 
65 70 75 80 

Tyr Leu Glu Val Aso Leu Pro Gly Asp Glu Leu Arg Pro Ser Met Gin 
85* 90 95 

Lys Leu Leu 



(2) INFORMATION FOR SEQ ID NO: 195: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 13 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: peptide 



(ix) FEATURE: 

(A) NAME /KEY : Peptide 

(B) LOCATION: 1 . . 13 

(D) OTHER INFORMATION: /note= "consensus telomerase RT sequence 

from motif A" 
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.(ix) FEATURE: 

(A) NAME/KEY: Modif ieci-site 

(B) LOCATION: 1 

(D) OTHER INFORMATION: /product = "OTHER" 

/note= "Xaa = hydrophobic amino acid, 
Ala, Leu, lie, Val, Pro, Phe, Trp 
or Met" 



(ix) FEATURE: 

(A) NAME/KEY: Modif ied-site 

(B) LOCATION: 4 

(D) OTHER INFORMATION: /product^ "OTHER" 

/note= "Xaa = hydrophobic amino acid, . 
Ala, Leu, lie, Val, Pro, Phe, Trp 
or Met" 



(ix) FEATURE: . 

(A) NAME/KEY: Modif ied-si te 

(B) LOCATION: 6 

(D) OTHER INFORMATION: /product= "OTHER" 

/note= "Xaa = hydrophobic amino acid, 
Ala, Leu, lie, Val, Pro, Phe, Trp 
or Met" 



(ix) FEATURE: 

(A) NAME /KEY : Modi f ied-si te 
(3) LOCATION: 10 

(D) OTHER INFORMATION: /product= "OTHER" 

/note= "Xaa = hydrophobic amino acid, 
Ala, Leu, lie, Val, Pro, Phe, Trp 
or Met" 



(ix) FEATURE : 

(A) NAME /KEY : Modi f ied-s i te 
(3) LOCATION: 13 

(D) OTHER INFORMATION: /product^ "OTHER" 

/note= "Xaa - hydrophobic amino acid, 
Ala, Leu, lie, Val, Pro, Phe, Trp 
or Met" 



(xi) SEQUENCE DESCRIPTION: SEQ ID MO: 195: 

Xaa Xa'a Xaa Xaa Asp Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa 
1 5 10 



(2) INFORMATION FOR SEQ ID NO: 196: 

(l) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 13 amino acids 

(B) TYPE: amino acid 

(C) STRAMDEDNESS: 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: peptide 
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(ix) FEATURE: 

(A) NAME /KEY : Peptide 

(B) LOCATION: 1 . . 13 

(D) OTHER INFORMATION: /note- "consensus telomerase RT sequence 

from motif B" 



(ix) FEATURE: 

(A) NAME/KEY: Modi f ied-s i te 

(B) LOCATION: 1 

(D) OTHER INFORMATION: /product^ "OTHER" 

/note= "Xaa = hydrophobic amino acid, 
Ala-, Leu, He, Val, Pro, Phe, Trp 
or Met" 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 196: 

Xaa Xaa Xaa Xaa Xaa Xaa Gin Gly Xaa Xaa Xaa Ser Pro 
1 5 10 



(2) INFORMATION FOR SEQ ID NO: 197: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 10 amino acids 
(3) TYPE: amino acid 

(C) STRANDEDNESS : 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: peptide 



(ix) FEATURE: 

(A) NAME /KEY : Peptide 
(3) LOCATION: 1 . . 10 

(D) OTHER INFORMATION: /note= "consensus telomerase RT sequence 

from motif C" 



(ix) FEATURE: 

(A) NAME /KEY : Modi f ied-s i te 
(3) LOCATION: 1 

(D) OTHER INFORMATION: /product^ "OTHER" 

/note- "Xaa = hydrophobic amino acid, 
Ala, Leu, He, Val, Pro, Phe, Trp 
or Met" 

(ix) FEATURE: 

(A) NAME /KEY : Modif ied-site 

(B) LOCATION: 5 

(D) OTHER INFORMATION: /product- "OTHER" 

/note- "Xaa = hydrophobic amino acid, 
Ala, Leu, He, Val, Pro, Phe, Trp 
or Met" 



(ix) FEATURE: 

(A) NAME/KEY: Modi f ied-s ite 

(B) LOCATION: 8 . . 10 

(D) OTHER INFORMATION: /product- "OTHER" 

/note= "Xaa = hydrophobic amino acid, 
Ala, Leu, He, Val, Pro, Phe, Trp 
or Met" 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 197: 

Xaa Xaa Xaa Tyr Xaa Asp Asp Xaa Xaa Xaa 
15 10 



(2) INFORMATION FOR SEQ ID NO: 198: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 8 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: peptide 



(ix) FEATURE: 

(A) NAME/KEY: Peptide 

(B) LOCATION: 1..8 

(D) OTHER INFORMATION: /note= "consensus telomerase RT sequence 

from motif D" 



(ix) FEATURE: 

(A) NAME/KEY: Modi f ied-si te 
(3) LOCATION: 2 

(D) OTHER INFORMATION: /product^ "OTHER" 

/note= "Xaa - hydrophobic amino acid, 
Ala, Leu, lie, Val, Pro, Phe, Trp 
or Met" 

(ix) FEATURE: 

(A) NAME /KEY : Modi f ied-s i te 
(3) LOCATION: 4 

( D) OTHER INFORMATION: /product= "OTHER" 

/note= "Xaa = hydrophobic amino acid, 
Aia, Leu, lie, Val, Pro, Phe, Trp 
or Met" 



(xi) SEQUENCE DESCRIPTION: SEQ ID MO: 198: 

Gly Xaa Xaa Xaa Xaa Xaa Xaa Lys 
1 5 



(2) INFORMATION FOR SEQ ID NO: 199: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 8 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: 

(D) TOPOLOGY: linear 



(ii) MOLECULE TYPE: peptide 



(ix) FEATURE: 

(A) NAME/KEY: Peptide 

(B) LOCATION: 1..8 

(D) OTHER INFORMATION: /note* 8 "consensus telomerase RT sequence 

from motif E" 

(ix) FEATURE: 

(A) NAME/KEY: Modif ied-si te 

(B) LOCATION: 1 

(D) OTHER INFORMATION: /product^ "OTHER" 

/note= "Xaa = hydrophobic amino acid, 
Ala, Leu, lie, Val, Pro, Phe, Trp 
or Met" 

(ix) FEATURE: 

(A) NAME /KEY: Modi f ied-si te 

(B) LOCATION: 3 

(D) OTHER INFORMATION: /product- "OTHER" 

/note= "Xaa = hydrophobic amino acid, 
Ala, Leu, lie, Val, Pro, Phe, Trp 
or Met" 

(ix) FEATURE: 

(A) NAME/KEY: Modi f ied-site 

(B) LOCATION: 6 

(D) OTHER INFORMATION: /product^ "OTHER" 

/note- "Xaa = hydrophobic amino acid, 
Ala, Leu, lie, Val, Pro, Phe, Trp 
or Met" 

(ix) FEATURE: 

(A) NAME /KEY : Modi f ied-si te 

(B) LOCATION: 8 

(D) OTHER INFORMATION: /product = "OTHER" 

/note= "Xaa = hydrophobic amino acid, 
Ala, Leu, lie, Val, Pro, Phe, Trp 
or Met" 



(xi) SEQUENCE DESCRIPTION: SEQ ID MO: 199: 

Xaa Xaa Xaa Leu Gly Xaa Xaa Xaa 
1 5 



(2) INFORMATION FOR SEQ ID NO: 200: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 34 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: peptide 



(ix) FEATURE: 

(A) NAME/ KEY : Peptide 

(B) LOCATION: 1 . . 34 

(D) OTHER INFORMATION: /note- "motif A peptide from 

Dong (LINE)" 
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(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 200: 

Lys Asn Arg Asn Leu His Cys Thr Tyr lie Asp Tyr Lys Lys Ala Phe 
15 10 15 

Asp Ser He Pro His Ser Trp Leu He Gin Val Leu Glu He Tyr Lys 
20 25 ' 30 

He Asn 



(2) INFORMATION FOR SEQ ID NO: 201: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 35 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: peptide 



(ix) FEATURE: 

(A) NAME /KEY : Peptide 

(B) LOCATION: 1..35 

(D) OTHER INFORMATION: /note= "motif B peptide from 

Dong (LINE)" 



(xi) SEQUENCE DESCRIPTION: SEQ ID MO: 201: 

Arg Gin lie Ala lie Lys Lys Gly lie Tyr Gin Gly Asp Ser Leu Ser 
1 5 10 15 

Pro Leu TrD Phe Cys Leu Ala Leu Asn Pro Leu Ser His Gin Leu His 
20 25 30 

Asn Asp Arg 
35 



(2) INFORMATION FOR SEQ ID MO: 202: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 4 3 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: peptide 



(ix) FEATURE: 

(A) NAME/KEY: Peptide 

(B) LOCATION: 1 . . 43 

(D) OTHER INFORMATION: /note- "motif C and D peptide from 

Dong (LINE)" 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO:202: 

His Leu lie Tyr Met Asp Asp lie Lys Leu Tyr Ala Lys Asn Asp Lys 
1 5 10 15 

Glu Met Lys Lys Leu lie Asp Thr Thr Thr He Phe Ser Asn Asp He 
20 25 30 

-Ser Met Gin Phe Gly Leu Asp Lys Cys Lys Thr 
35 40 



(2) INFORMATION FOR SEQ ID NO:203: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 11 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: peptide 



(ix) FEATURE : 

(A) NAME /KEY : Peptide 
(3) LOCATION: 1..11 

(D) OTHER INFORMATION: /note= "motif E peptide from 

Dong (LINE) " 



(xi) SEQUENCE DESCRIPTION: SEQ ID MO:203: 

Lys Cys Leu Tyr Lys Tyr Leu Gly Phe Gin Gin 
1 5 10 



INFORMATION FOR SEQ ID MO: 204: 

(l) SEQUENCE CHARACTERISTICS: 

(A) LENGTH : 34 amino acids 

(3) TYPE: amino acid 

(C) STRANDEDNESS: 

{ D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: peptide 



(ix) FEATURE: 

(A) NAME/KEY: Peptide 

(B) LOCATION: 1..34 

(D) OTHER INFORMATION: /note= "motif A peptide from 

Saccharomyces cerevisiae cytochrome 
oxidase group II intron 1-encoded 
mitochondrial protein" 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 204: 



Phe Gly Gly Ser Asn Trp Phe Arg Glu Val Asp Leu Lys Lys Cys Phe 
15 10 15 



Asp Thr He Ser His Asp Leu He He Lys Glu Leu Lys Arg Tyr lie 
20 25 30 



Ser Asp 



(2) INFORMATION FOR SEQ ID NO: 205: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 35 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: peptide 



fix) FEATURE : 

(A) NAME/KEY: Peptide 

(B) LOCATION: 1 . . 35 

(D) OTHER INFORMATION: /note= "motif B peptide from 

Saccharomyces cerevisiae^ cytochrome 
oxidase group II intron 1-encoded 
mitochondrial protein" 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 205: 

His Val Pro Val Gly Pro Arg Val Cys Val Gin Gly Ala Pro Thr Ser 
15 10 15 

Pro Ala Leu Cys Asn Ala Val Leu Leu Arg Leu Asp Arg Arg Leu Ala 
20 25 ^ 30 

Gly Leu Ala 
35 



(2) INFORMATION FOR SEQ ID NO: 206: 

(i) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 25 amino acids 
(3) TYPE: amino acid 

(C) STRANDEDNESS: 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: peptide 



(ix) FEATURE: 

(A) NAME /KEY : Peptide 

(B) LOCATION: 1..2 5 

(D) OTHER INFORMATION : /note- "motif D peptide from 

Saccharomyces cerevisiae cytochrome 
oxidase group II intron 1-encoded 
mitochondrial protein" 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO:206: 



Lys lie lie Lys Arg Asp Leu Asn Asn Phe Leu Asn Ser Leu Gly Leu 
1 5 10 15 

Thr lie Asn Glu Glu Lys Thr Leu lie 
20 25 



(2) INFORMATION FOR SEQ ID NO: 207: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 34 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: peptide 



(ix) FEATURE: 

(A) NAME/KEY: Peptide 

(B) LOCATION: 1..34 

(D) OTHER INFORMATION: /note= "motif A peptide from HIV RT " 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO:207: 

Leu Lys Lys Lys Lys Ser Val Thr Val Leu Asp Val Gly Asp Ala Tyr 
1 5 10 15 

Phe Ser Val Pro Leu Asp Glu Asp Phe Arg Lys Tyr Thr Ala Phe Thr 
20 25 30 

lie Pro 



(2) INFORMATION FOR SEQ ID NO:208: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH : 35 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: peptide 



(ix) FEATURE: 

(A) NAME/KEY: Peptide 

(B) LOCATION: 1 . . 35 

(D) OTHER INFORMATION: /note- "motif B peptide from HIV RT 11 



(xi) SEQUENCE DESCRIPTION: SEQ ID MO: 208: 

Gly He Arg Tyr Gin Tyr Asn Val Leu Pro Gin Gly Trp Lys Gly Ser 
1 5 10 " 15 



Pro Ala He Phe Gin Ser Ser Met Thr Lys He Leu Glu Pro Phe Arg 
20 25 30 
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Lys Gin Asn 
35 



(2) INFORMATION FOR SEQ ID NO:209: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 17 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: peptide 



(ix) FEATURE: 

(A) NAME/KEY: Peptide 

(B) LOCATION: 1..17 

(D) OTHER INFORMATION: /note= "motif C peptide from HIV RT" 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO:209: 

lie Tyr Gin Tyr Met Asp Asp Leu Tyr Val Gly Ser His Leu Glu He 
1 5 * 10 15 

Gly 



(2) INFORMATION FOR SEQ ID NO;210: 

(i) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 37 amino acids 
(3) TYPE: amino acid 

(C) STRANDEDNESS : 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: peptide 



(ix) FEATURE: 

(A) NAME/ KEY : Peptide 

(B) LOCATION: 1 . . 37 

(D) OTHER INFORMATION: /note= "motif D and E peptide from 

HIV RT" 



(xi) SEQUENCE DESCRIPTION: SEQ ID MO:210: 

His Arg Thr Lys He Glu Glu Leu Arg Gin His Leu Leu Arg Trp Gly 

1 5 10 15' 

Leu Thr Thr Pro Asp Lys Lys His Gin Lys Glu Pro Pro Phe Leu Trp 
20 25 30 

Met Gly Tyr Glu Leu 
35 
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(2) INFORMATION FOR SEQ ID NO: 211: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 38 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: peptide 



(ix) FEATURE: 

(A) NAME/KEY: Peptide 

(B) LOCATION: 1..38 

(D) OTHER INFORMATION: /note- "motif C and D peptide from 

Saccharomyces cerevisiae 
ESTp (L8543.12)". 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO:211: 

lie Leu Lys Leu Ala Asp Asp Phe Leu He He Ser Thr Asp Gin Gin 
15 10 15 

Gin Val lie Asn He Lys Lys Leu Ala Met Gly Gly Phe Gin Lys Tyr 
20 25 30 

Asn Ala Lys Ala Asn Arg 
35 



(2) INFORMATION FOR SEQ ID NO: 212: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 11 amino acids 
(3) TYPE: amino acid 

(C) STRANDEDNESS : 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: peptide 



(ix) FEATURE: 

(A) NAME/ KEY : Peptide 

(B) LOCATION: 1 . . 11 

(D) OTHER INFORMATION: /note- "motif E peptide from 

Saccharomyces cerevisiae 
ESTp {L8543. 12) " 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO:212: 

He Arg Ser Lys Ser Ser Lys Gly He Phe Arg 
1 5 10 




INFORMATION FOR SEQ ID NO:213: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 39 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: peptide 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO:213: 

Leu Gin Lys Gin Leu Glu Phe Tyr Phe Ser Asp Ala Asn Leu Tyr Asn 
1 5 10 15 

Asp Ser Phe Leu Arg Lys Leu Val Leu Lys Ser Gly Glu Gin Arg Val 
20 25 30 

Glu He Glu Thr Leu Leu Met 
35 



INFORMATION FOR SEQ ID NO: 214: 

(i) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 38 amino acids 

(B) TYPE : amino acid 

(C) STRANDEDNESS : 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: peptide 



(xi) SEQUENCE DESCRIPTION: SEQ ID MO: 214: 

lie Cys His Gin Xaa Glu Tvr Tyr Phe Gly Asp Phe Asn Leu Pro Arg 

1 ' 5 10 15 

Asp Lys Phe Leu Lys Glu Gin He Lys Leu Asp Glu Gly Trp Val Pro 
20 25 30 

Leu Glu He Met He Lys 
35 



INFORMATION FOR SEQ ID NO: 215: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 38 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: peptide 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO:215: 

He Cys Glu Gin lie Glu Tyr Tyr Phe Gly Asp His Asn Leu Pro Arg 
1 5 10 15 
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Asp Lys Phe Leu Lys Gin Gin He Leu Leu Asp Asp Gly Trp Val Pro 
20 25 30 

Leu Glu Thr Met He Lys 
35 



INFORMATION FOR SEQ ID NO: 216: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 39 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: peptide 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO:216: 

He Leu Arg Gin Val Glu Tyr Tyr Phe Gly Asp Ala Asn Leu Asn Arg 
1 5 10 15 

Asp Lys Phe Leu Arg Glu Gin He Gly Lys Asn Glu Asp Gly Trp Val 
20 25 30 

Pro Leu Ser Val Leu Val Thr 
35 



INFORMATION FOR SEQ ID NO: 217: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 38 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: peptide 



(y.i) SEQUENCE DESCRIPTION: SEQ ID HO: 217: 

Cys Leu Lys Gin Val Glu Phe Tyr Phe Ser Glu Phe Asn Phe Pro Tyr 

1 5 10 15 

Aso Arg Phe Leu Arg Thr Thr Ala Glu Lys Asn Asp Gly Trp Val Pro 
20 25 30 

He Ser Thr He Ala Thr 
35 



INFORMATION FOR SEQ ID NO: 218: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 2421 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 
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(ii) MOLECULE TYPE: DNA (genomic) 



(ix) FEATURE: 

(A) NAME/KEY: CDS 

(B) LOCATION: 92,, 2251 

•(D) OTHER INFORMATION: /codpn= (seq: M taa", aa : Gin) 

/codon= (seq: "tag", aa: Gin) 
/codon= (seq: "tga", aa : TRM) 
/product^ "Tetrahymena thermophila 
80 kDa protein subunit" 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 218: 

AACTCATTTA ATTACTAATT TAATCAACAA GATTGATAAA AAGCAGTAAA TAAAACCCAA 60 

TAGATTTAAT TTAGAAAGTA TCAATTGAAA A ATG GAA ATT ,GAA AAC AAC TAA 112 

Met Glu He Glu Asn Asn Gin 
1 5 

GCA CAA TAG CCA AAA GCC GAA AAA TTG TGG TGG GAA CTT GAA TTA GAG 160 
Ala Gin Gin Pro Lys Ala Glu Lys Leu Trp Trp Glu Leu Glu Leu Glu 
10 15 20 

ATG CAA GAA AAC CAA AAT GAT ATA TAA GTT AGG GTT AAG ATT GAC GAT 208 
Met Gin Glu Asn Gin Asn Asp He Gin Val Arg Val Lys He Asp Asp 
25 30 35 

CCT AAG CAA TAT CTC GTG AAC GTC ACT GCA GCA TGT TTG TTG TAG GAA 256 
Pro Lys Gin Tyr Leu Val Asn Val Thr Ala Ala Cys Leu Leu Gin Glu 
4 0 4 5 50 55 

GGT AGT TAC TAC TAA GAT AAA GAT GAA AG A AGA TAT ATC ATC ACT AAA 304 
Gly Ser Tyr Tyr Gin Asp Lys Asp Glu Arc Arg Tyr He He Thr Lys 
60 * 65 70 

GCA CTT CTT GAG GTG GCT GAG TCT GAT CCT GAG TTC ATC TGC TAG TTG 352 
Aia Leu Leu Glu Val Ala Glu Ser Asp Pro Glu Phe He Cys Gin Leu 
75 80 85 

GCA GTC TAC ATC CGT AAT GAA CTT TAC ATC AGA ACT ACC ACT AAC TAC 4 00 

Ala Val Tyr He Arg Asn Glu Leu Tyr He Arg Thr Thr Thr Asn Tyr 
90 95 100 

ATT GTA GCA TTT TGT GTT GTC CAC AAG AAT ACT CAA CCA TTC ATC GAA 4 48 

He Val Ala Phe Cys Val Val His Lys Asn Thr Gin Pro Phe He Glu 
105 110 115 

AAG TAC TTC AAC AAA GCA GTA CTT TTG CCT AAT GAC TTA CTG GAA GTC 4 96 

Lys Tyr Phe Asn Lys Ala Val Leu Leu Pro Asn Asp Leu Leu Glu Val 
120 125 130 ^ 135 

TGT GAA TTT GCA. TAG GTT CTC TAT ATT TTT GAT GCA ACT GAA TTC AAA 54 4 

Cys Glu Phe Ala Gin Val Leu Tyr He Phe Asp Ala Thr Glu Phe Lys 
140 145 150 

AAT TTG TAT CTT GAT AGG ATA CTT TCA TAA GAT ATT CGT AAG GAA CTC 592 
Asn Leu Tyr Leu Asp Arg He Leu Ser Gin Asp lie Arg Lys Glu Leu 
155 160 165 
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ACT TTC CGT AAG TGT TTA CAA AG A TGC GTC AGA AGC AAG TTT TCT GAA 64 0 

Thr Phe Arg Lys Cys Leu Gin Arg Cys Val Arg Ser Lys Phe Ser Glu 
170 175 180 

TTC AAC GAA TAC TAA CTT GGT AAG TAT TGC ACT GAA TCC TAA CGT AAG 68 8 

Phe Asn Glu Tyr Gin Leu Gly Lys Tyr Cys Thr Glu Ser Gin Arg Lys 
185 190 195 

AAA ACA ATG TTC CGT TAC CTC TCA GTT ACC AAC AAG TAA AAG TGG GAT 7 36 

Lys Thr Met Phe Arg Tyr Leu Ser Val Thr Asn Lys Gin Lys Trp Asp 
200 205 210 215 

TAA ACT AAG AAG AAG AGA AAA GAG AAT CTC TTA ACC AAA CTT TAG GCA 78 4 

Gin Thr Lys Lys Lys Arg Lys Glu Asn Leu Leu Thr Lys Leu Gin Ala 
220 225 230 

ATA AAG GAA TCT GAA GAT AAG TCC AAG AGA GAA ACT GGA GAC ATA ATG 832 
lie Lys Glu Ser Glu Asp Lys Ser Lys Arg Glu Thr Gly Asp He Met 
235 240 245 

AAC GTT GAA GAT GCA ATC AAG GCT TTA AAA CCA GCA GTT ATG AAG AAA 880 
Asn Val Glu Asp Ala He Lys Ala Leu Lys Pro Ala Val Met Lys Lys 
250 ' 255 260 

ATA GCC AAG AGA TAG AAT GCC ATG AAG AAA CAC ATG AAG GCA CCT AAA 92 8 

lie Ala Lys Arg Gin Asn Ala Met Lys Lys His Met Lys Ala Pro Lys 
265 J 270 275 

ATT CCT AAC TCT ACC TTG GAA TCA AAG TAC TTG ACC TTC AAG GAT CTC 97 6 

lie Pro Asn Ser Thr Leu Glu Ser Lys Tyr Leu Thr Phe Lys Asp Leu 
230 285 290 295 

ATT AAG TTC TGC CAT ATT TCT GAG CCT AAA GAA AGA GTC TAT AAG ATC 102 4 

He Lys Phe Cys His He Ser Glu Pro Lys Glu Arg Val Tyr Lys He 
300 305 310 

CTT GGT AAA AAA TAC CCT AAG ACC GAA GAG GAA TAC AAA GCA GCC TTT 107 2 

Leu Gly Lys Lys Tyr Pro Lys Thr Glu Glu Glu Tyr Lys Ala Ala Phe 
315 320 325 

GGT GAT TCT GCA TCT GCA CCC TTC AAT CCT GAA TTG GCT GGA AAG CGT 1120 
Gly Asp Ser Ala Ser Ala Pro Phe Asn Pro Glu Leu Ala Gly Lys Arg ■ 
330 335 340 

ATG AAG ATT GAA ATC TCT AAA ACA TGG GAA AAT GAA CTC AGT GCA AAA 1168 
Met Lys lie Glu He Ser Lys Thr Trp Glu Asn Glu Leu Ser Ala Lys 
34 5 350 355 

GGC AAC ACT GCT GAG GTT TGG GAT AAT TTA ATT TCA AGC AAT TAA CTC 1216 
Gly Asn Thr Ala Glu Val Trp Asp Asn Leu He Ser Ser Asn Gin Leu 
360 365 370 375 

CCA TAT ATG GCC ATG TTA CGT AAC TTG TCT AAC ATC TTA AAA GCC GGT 12 64 

Pro Tyr Met Ala Met Leu Arg Asn Leu Ser Asn He Leu Lys Ala Gly 
380 335 390 

GTT TCA GAT ACT ACA CAC TCT ATT GTG ATC AAC AAG ATT TGT GAG CCC 1312 
Val Ser Asp Thr Thr His Ser lie Val lie Asn Lys lie Cys Glu Pro 
395 400 405 
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AAG GCC GTT GAG AAC 
Lys Ala Val Glu Asn 
410 

ATT GAA GCT GTT AAT 
He Glu Ala Val Asn 
425 

AG A GAA AAT ATG AAT 
Arg Glu Asn Met Asn 
440 

GTT GAA AAA ACC GAT 
Val Glu Lys Thr Asp 
460 

GAA GAA GGA GAA TTT 
Glu Glu Gly Glu Phe 
475 

ATT AAC TCC ATT GAA 
He Asn Ser He Glu 
490 

GAT GAA ATC AAA GGA 
Asp Glu He Lys Gly 

505 

ATG AGT ACC TCA ATG 
Met Ser Thr Ser Met 

520 

ACT TGT CTC GAG TGT 
Thr Cys Leu Glu Cys 
540 

TGT GAA PAG TCC TCA 
Cys Glu Lys Ser Ser 
555 

AAT AAG TGT TAC TTA 
Asn Lys Cys Tyr Leu 
570 

TCT ATG TM AAA CTT 
Ser Met Gin Lys Leu 

585 

GAT TTC CCC TAT GAG 
Asp Phe Pro Tyr Glu 
600 

GTA GAC AAT ATC GTT 
Val Asp Asn He Val 
620 

TCA GAT ATC AAT GTT 
Ser Asp He Asn Val 
635 



TCC AAG ATG TTC 
Ser Lys Met Phe 
415 

GAA GCA GTT ACT 
Glu Ala Val Thr 
430 

CTT AAA GGT CAA 
Leu Lys Gly Gin 
445 

GAA GAG AAG AAA 
Glu Glu Lys Lys 



GTT AAA GTC AAC 
Val Lys Val Asn 
480 

CTT GCA ATC AAG 
Leu Ala He Lys 
495 

CAC ACT GCA ATC 
His Thr Ala lie 
510 

TCA GGT GGA GCC 
Ser Gly Gly Ala 
525 

GCA TTA GTC CTT 
Ala Leu Val Leu 



TTC TAC ATC TTC 
Phe Tyr lie Phe 
560 

GAA GTT GAT CTC 
Glu Val Asp Leu 
575 

TTG CAA GAG AAA 
Leu Gin Glu Lys 
590 

TGC ATT GAT GAA 
Cys He Asp Glu 
605 

ATT TTG TCT GAT 
He Leu Ser Asp 

AGA GGC AGT TCC 
Arg Gly Ser Ser 
640 



CCT CTT CAA TTC TTT 
Pro Leu Gin Phe Phe 
420 

AAG GGA TTC AAG GCC 
Lys Gly Phe Lys Ala 
435 

ATC GAA GCA GTA AAG 
He Glu Ala Val Lys 
450 

GAT ATG GAG TTG GAG 
Asp Met Glu Leu Glu 
465 

GAA GGA ATT GGC AAG 
Glu Gly He Gly Lys 
485 

ATA GCA GTT AAC AAG 
He Ala Val Asn Lys 
500 

TTC TCT GAT GTT TCT 
Phe Ser Asp Val Ser 
515 

AAG AAG TAT GGT TCC 
Lys Lys Tyr Gly Ser 
530 

GGT TTG ATG GTA AAA 
Gly Leu Met Val Lys 
545 

AGT TCA CCT AGT TCT 
Ser Ser Pro Ser Ser 
565 

CCT GGA GAC GAA CTC 
Pro Gly Asp Glu Leu 
580 

GGA AAA CTT GGT GGT 
Gly Lys Leu Gly Gly 
595 

TGG ACA AAG AAT AAA 
Trp Thr Lys Asn Lys 
610 

ATG ATG ATT GCA GAA 
Met Met He Ala Glu 
625 

ATT GTT AAC AGC ATC 
He Val Asn Ser He 
645 



AGT GCC 13 60 

Ser Ala 

AAG AAG 14 08 

Lys Lys 

GAA GTT 14 56 

Glu Val 
455 

TAA ACC 1504 

Gin Thr 

470 

CAA TAC 1552 
Gin Tyr 

AAT TTA 1600 
Asn Leu 



GGT TCT 1648 
Gly Ser 



GTT CGT 1696 
Val Arg 
535 

TAA CGT 1744 

Gin Arg 

550 

CAA TGC 17 92 

Gin Cys 

CGT CCT 1840 
Arg Pro 

GGT ACT 1888 
Gly Thr 



ACT CAC 193 6 

Thr His 
615 

GGA TAT 198 4 

Gly Tyr 

630 

AAA AAG 2032 
Lys Lys 



148 



TAC AAG GAT GAA GTA AAT CCT AAC ATT AAA ATC TTT GCA GTT GAC TTA 2080 
Tyr Lys Asp Glu Val Asn Pro Asn lie Lys lie Phe Ala Val Asp Leu 
650 655 660 

GAA GGT TAC GGA AAG TGC CTT AAT CTA GGT GAT GAG TTC AAT GAA AAC 212 8 

Glu Gly Tyr Gly Lys Cys Leu Asn Leu Gly Asp Glu Phe Asn Glu Asn 
665 670 675 

AAC TAC ATC AAG ATA TTC GGT ATG AGC GAT TCA ATC TTA AAG TTC ATT 217 6 

Asn Tyr lie Lys lie Phe Gly Met Ser Asp Ser lie Leu Lys Phe lie 
680 685 690 695 

TCA GCC AAG CAA GGA GGA GCA AAT ATG GTC GAA GTT ATC AAA AAC TTT 2224 
Ser Ala Lys Gin Gly Gly Ala Asn Met Val Glu Val He Lys Asn Phe 
700 705 710 

GCC CTT CAA AAA ATA GGA CAA AAG TGAGTTTCTT GAGATTCTTC TATAACAAAA 227 8 
Ala Leu Gin Lys He Gly Gin Lys 
715 

ATCTCACCCC ACTTTTTTGT TTTATTGCAT AGCCATTATG AAATTTAAAT TATTATCTAT 2338 

TTATTTAAGT TACTTACATA GTTTATGTAT CGCAGTCTAT TAGCCTATTC AAATGATTCT 2398 

GCAAAGAACA AAAAAGATTA AAA 24 21 

(2) INFORMATION FOR SEQ ID MO: 219: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 719 amino acids 

(B) TYPE: amino acid 
{ D) TOPOLOGY: linear 

(li) MOLECULE TYPE: protein 

(>:i) SEQUENCE DESCRIPTION: SEQ ID NO:219: 

Met Glu lie Glu Asn Asn Gin Ala Gin Gin Pro Lys Ala Glu Lys Leu 
1 5 10 15 

Trp Trp Glu Leu Glu Leu Glu Met Gin Glu Asn Gin Asn Asp He Gin 
20 25 30 

Val Arg Val Lys He Asp Asp Pro Lys Gin Tyr Leu Val Asn Val Thr 
35 40 45 

Ala Ala Cys Leu Leu Gin Glu Gly Ser Tyr Tyr Gin Asp Lys Asp Glu 
50 55 60 

Arg Arg Tyr lie lie Thr Lys Ala Leu Leu Glu Val Ala Glu Ser Asp 
65 70 75 * 80 

Pro Glu Phe He Cys Gin Leu Ala Val Tyr lie Arg Asn Glu Leu Tyr 
85 90 95 

lie Arg Thr Thr Thr Asn Tyr lie Val Ala Phe Cys Val Val His Lys 
100 105 110 



Asn Thr Gin 
115 



Pro Phe 



lie Glu Lys Tyr Phe Asn Lys Ala Val Leu Leu 
120 125 



149 



Pro Asn Asp Leu Leu Glu Val Cys Glu Phe Ala Gin Val Leu Tyr He 
130 135 140 

Phe Asp Ala Thr Glu Phe Lys Asn Leu Tyr Leu Asp Arg He Leu Ser 
145 150 155 160 

Gin Asp He Arg Lys Glu Leu Thr Phe Arg Lys Cys Leu Gin Arg Cys 
165 170 175 

Val Arg Ser Lys Phe Ser Glu Phe Asn Glu Tyr Gin Leu Gly Lys Tyr 
180 185 190 

Cys Thr Glu Ser Gin Arg Lys Lys Thr Met Phe Arg Tyr Leu Ser Val 
195 200 205 

Thr Asn Lys Gin Lys Trp Asp Gin Thr Lys Lys Lys Arg Lys Glu Asn 
210 215 220 

Leu Leu Thr Lys Leu Gin Ala He Lys Glu Ser Glu Asp Lys Ser Lys 
225 230 235 240 

Arg Glu Thr Gly Asp He Met Asn Val Glu Asp Ala He Lys Ala Leu 
245 250 255 

Lys Pro Ala Val Met Lys Lys He Ala Lys Arg Gin Asn Ala Met Lys 
260 265 270 

Lvs His Met: Lys Ala Pro Lys He Pro Asn Ser Thr Leu Glu Ser Lys 
275 280 285 

Tvr Leu Thr Phe Lys Asp Leu He Lys Phe Cys His He Ser Glu Pro 
290 295 300 

Lys Glu Arc Val Tyr Lys He Leu' Gly Lys Lys Tyr Pro Lys Thr Glu 
305 ' 310 315 320 

Glu Glu Tyr Lys Ala Ala Phe Gly Asp Ser Ala Ser Ala Pro Phe Asn 
325 330 335 

Pro Glu Leu Ala Gly Lys Arg Met Lys lie Glu He Ser Lys Thr Trp 
340 345 350 

Glu Asn Glu Leu Ser Ala Lys Gly Asn Thr Ala Glu Val Trp Asp Asn 
355 360 365 

Leu He Ser Ser Asn Gin Leu Pro Tyr Met Ala Met Leu Arg Asn Leu 
370 375 380 

S-r Asn He Leu Lys Ala Gly Val Ser Aso Thr Thr His Ser lie , Val 
385 390 395 400 

He Asn Lys lie Cys Glu Pro Lys Ala Val Glu Asn Ser Lys Met Phe 
405 410 415 

Pro Leu Gin Phe Phe Ser Ala He Glu Ala Val Asn Glu Ala Val Thr 
420 425 ° 430 



Lys Gly Phe Lys Ala Lys Lys Arg Glu Asn Met Asn Leu Lys Gly Gin 
435 440 445 



lie Glu Ala Val Lys Glu Val Val Glu Lys Thr Asp Glu Glu Lys Lys 
450 455 460 



Asp Met Glu Leu Glu Gin Thr Glu Glu Gly Glu Phe Val Lys Val Asn 
465 470 475 480 

Glu Gly He Gly Lys Gin Tyr He Asn Ser He Glu Leu Ala He Lys 
485 490 495 

He Ala Val Asn Lys Asn Leu Asp Glu He Lys Gly His Thr Ala He 
500 505 510 

Phe Ser Asp Val Ser Gly Ser Met Ser Thr Ser Met Ser Gly Gly Ala 
515 520 525 

Lys Lys Tyr Gly Ser Val Arg Thr Cys Leu Glu Cys Ala Leu Val Leu 
530 535 540 

Gly Leu Met Val Lys Gin Arg Cys Glu Lys Ser Ser Phe Tyr He Phe 
545 550 555 560 

Ser Ser Pro Ser Ser Gin Cys Asn Lys Cys Tyr Leu Glu Val Asp Leu 
565 570 575 

Pro Gly Asp Glu Leu Arg Pro Ser Met Gin Lys Leu Leu Gin Glu Lys 
580 585 590 

Gly Lys Leu Gly Gly Gly Thr Asp Phe Pro Tyr Glu Cys He Asp Glu 
595 600 605 

Trp Thr Lys Asn Lys Thr His Val Asp Asn He Val He Leu Ser Asp 
610 615 620 

Mec Met He Ala Glu Gly Tyr Ser Asp He Asn Val Arg Gly Ser Ser 

625 630' 635 640 

He Val Asn Ser He Lys Lys Tyr Lys Asp Glu Val Asn Pro Asn He 
645 650 655 

Lys lie Phe Ala Val Asp' Leu Glu Gly Tyr Gly Lys Cys Leu Asn Leu 
660 665 670 

Gly Asp Glu Phe Asn Glu Asn Asn Tyr lie Lys He Phe Gly Met Ser 
675 680 685 

Asp Ser He Leu Lys Phe lie Ser Ala Lys Gin Gly Gly Ala Asn Met 
690 695 700 

Val Glu Val He Lys Asn Phe Ala Leu Gin Lys He Gly Gin Lys 
705 710 715 



(2) INFORMATION FOR SEQ ID NO:220: 

(i) SEQUENCE CHARACTERISTICS : 

(A) LENGTH : 2829 base pairs 
(3) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 



(ii) MOLECULE TYPE: DNA (genomic) 



(ix) FEATURE: 

(A) NAME/KEY: CDS 

(B) LOCATION: 47.. 2665 

(D) OTHER INFORMATION: /codon= (seq: "taa", aa: Gin) 

/codon= (seq: "tag", aa: Gin) 
/codon= (seq: "tga", aa: TRM) 
/product= "Tetrahymena thermophila 
95 kDa protein subunit" 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO:220: 

TCAATACTAT TAATTAATAA ATAAAAAAAA GCAAACTACA AAGAAA ATG TCA AGG 55 

Met Ser Arg 
1 

CGT AAC TAA AAA AAG CCA TAG GCT CCT ATA GGC AAT GAA AC A AAT CTT 103 
Arg Asn Gin Lys Lys Pro Gin Ala Pro lie Gly Asn Glu Thr Asn Leu 
5 10 15 

GAT TTT GTA TTA CAA AAT CTA GAA GTT TAC AAA AGC CAG ATT GAG CAT 151 
Asp Phe Val Leu Gin Asn Leu Glu Val Tyr Lys Ser Gin He Glu His 
20 25 30 35 

TAT AAG ACC TAG TAG TAA TAG ATC AAA GAG GAG GAT CTC AAG CTT TTA 199 
Tyr Lys Thr Gin Gin Gin Gin He Lys Glu Glu Asp Leu Lys Leu Leu 
40 45 50 

AAG TTC AAA AAT TAA GAT TAG GAT GGA AAC TCT GGC AAC GAT GAT GAT 24 7 

Lys Phe Lys Asn Gin Asp Gin Asp Gly Asn Ser Gly Asn Asp Asp Asp 
55 60 65 

GAT GAA GAA AAC AAC TCA AAT AAA TAA TAA GAA TTA TTA AGG AGA GTC 2 95 

Asp Glu Glu Asn Asn Ser Asn Lys Gin Gin Glu Leu Leu Arg Arg Val 
70 75 80 

AAT TAG ATT .AAG TAG CAA GTT TAA TTG ATA AAA AAA GTT GGT TCT AAG 34 3 

Asn Gin He Lys Gin Gin Val Gin Leu He Lys Lys Val Gly Ser Lys 
85 . 90 95 

GTA GAG AAA GAT TTG AAT TTG AAC GAA GAT GAA AAC AAA AAG AAT GGA 391 
Val Glu Lys Asp Leu Asn Leu Asn Glu Asp Glu Asn Lys Lys Asn Gly 
100 105 110 " 115 

CTT TCT GAA TAG CAA GTG AAA GAA GAG TAA TTA AGA ACG ATT ACT GAA 4 39 

Leu Ser Glu Gin Gin Val Lys Glu Glu Gin Leu Arg Thr He Thr Glu 
120 125 130 

GAA TAG GTT AAG TAT TAA AAT TTA GTA TTT AAC ATG GAC TAC CAG TTA 4 87 

Glu Gin Val Lys Tyr Gin Asn Leu Val Phe Asn Met Asp Tyr Gin Leu 
135 140 ^ 145 

GAT TTA AAT GAG AGT GGT GGC CAT AGA AGA CAC AGA AGA GAA ACA GAT 535 
Asp Leu Asn Glu Ser Gly Gly His Arg Arg His Arg Arg Glu Thr Asp 
150 155 160 



TAT GAT ACT GAA AAA TGG TTT GAA ATA TCT CAT GAC CAA AAA AAT TAT 
Tyr Asp Thr Glu Lys Trp Phe Glu He Ser His Asp Gin Lys Asn Tyr 
165 170 "l75 



583 



152 



GTA TCA 
Val Ser 
180 



ATT TAC 
lie Tyr 



GCC AAC 
Ala Asn 
185 



TAA AAG 
Gin Lys 



ACA TCA TAT 
Thr Ser Tyr 
190 



TGT TGG 
Cys Trp 



TGG 
Trp 



CTT AAA 
Leu Lys 
195 



631 



GAT TAT 
Asp Tyr 



TTT AAT 
Phe Asn 



AAA AAC 
Lys Asn 
200 



AAT TAT 
Asn Tyr 



GAT CAT CTT 
Asp His Leu 
205 



AAT GTA 
Asn Val 



AGC 
Ser 



ATT AAC 
lie Asn 
210 



679 



AGA CTA 
Arg Leu 



GAA ACT 
Glu Thr 
215 



GAA GCC 
Glu Ala 



GAA TTC 
Glu Phe 



TAT GCC TTT 
Tyr Ala Phe 
220 



GAT GAT 
Asp Asp 



TTT 
Phe 
225 



TCA CAA 
Ser Gin 



727 



ACA ATC 
Thr He 



AAA CTT 
Lys Leu 
230 



ACT AAT 
Thr Asn 



AAT TCT 
Asn Ser 
235 



TAC TAG ACT 
Tyr Gin Thr 



GTT AAC 
Val Asn 
240 



ATA 
He 



GAC GTT 
Asp Val 



775 



AAT TTT 
Asn Phe 
245 



GAT AAT 
Asp Asn 



AAT CTC 
Asn Leu 



TGT ATA 
Cys He 
250 



CTC GCA TTG 
Leu Ala Leu 



CTT AGA 
Leu Arg 
255 



TTT TTA TTA 
Phe Leu Leu 



823 



TCA CTA 
Ser Leu 

260 



GAA AGA 
Glu Arg 



TTC AAT 
Phe Asn 
265 



ATT TTG 
lie Leu 



AAT ATA AGA 
Asn He Arg 
270 



TCT TCT 
Ser Ser 



TAT ACA AGA 
Tyr Thr Arg 
275 



871 



AAT TAA. 
Asn Gin 



TAT AAT 
Tyr Asn 



TTT GAG 
Phe Glu 
280 



AAA ATT 
Lys He 



GGT GAG CTA 
Gly Glu Leu 
285 



CTT GAA 
Leu Glu 



ACT ATC TTC 
Thr He Phe 
290 



919 



GCA GTT 
Ala Val 



GTC TTT 
Val Phe 
295 



TCT CAT 
Ser His 



CGC CAC 
Arg His 



TTA CAA GGC 
Leu Gin Gly 
300 



ATT CAT 
lie His 



TTA CAA GTT 
Leu Gin Val 
305 



967 



CCT TGC 
Pro Cys 



GAA GCG 
Glu Ala 
310 



TTC TAA 
Phe Gin 



TAT TTA 
Tyr Leu 
315 



GTT AAC TCC 
Val Asn Ser 



TCA TCA 
Ser Ser 
320 



TAA. ATT AGC 
Gin He Ser 



1015 



GTT AAA 
val Lys 
325 



GAT AGC 
Aso Ser 



TAA TTA 
Gin Leu 



TAG GTA 
Gin Val 
330 



TAC TCT TTC 
Tyr Ser Phe 



TCT ACA 
Ser Thr 
335 



GAC TTA AAA 
Asp Leu Lys 



1063 



TTA GTT 
Leu Val 
340 



GAC ACT 
Asp Thr 



AAC AAA 
Asn Lys 
345 



GTC CAA 
Val Gin 



GAT TAT TTT 
Asp Tyr Phe 
350 



AAG TTC 
Lys Phe 



TTA TAA GAA 
Leu Gin Glu 
355 



1111 



TTC CCT 
Phe Pro 



CGT TTG 
Arg Leu 



ACT CAT 
Thr His 
360 



GTA AGC 
Val Ser 



TAG TAG GCT 
Gin Gin Ala 
365 



ATC CCA 
He Pro 



GTT AGT GCT 
Val Ser Ala 
370 



1159 



ACT AAC 
Thr Asn 



GCT GTA 
Ala Val 
375 



GAG AAC 
Glu Asn 



CTC AAT 
Leu Asn 



GTT TTA CTT 
Val Leu Leu 
380 



AAA AAG 
Lys Lys 



GTC AAG CAT 
Val Lys His 
385 



1207 



GCT AAT 
Ala Asn 



CTT AAT 
Leu Asn 
390 



TTA GTT 
Leu Val 



TCT ATC 
Ser He 
395 



CCT ACC TAA 
Pro Thr Gin 



TTC AAT 
Phe Asn 
400 



TTT GAT TTC 
Phe Asp Phe 



1255 



TAC TTT 
Tyr Phe 
405 



GTT AAT 
Val Asn 



TTA TAA 
Leu Gin 



CAT TTG 
His Leu 
410 



AAA TTA GAG 
Lys Leu Glu 



TTT GGA 
Phe Gly 
415 



TTA GAA CCA 
Leu Glu Pro 



1303 



AAT ATT TTG ACA AAA CAA AAG CTT GAA AAT CTA CTT TTG AGT ATA AAA 1351 
Asn lie Leu Thr Lys Gin Lys Leu Glu Asn Leu Leu Leu Ser lie Lys 
420 425 430 435 

TAA TCA AAA AAT CTT AAA TTT TTA AGA TTA AAC TTT TAC ACC TAC GTT 1399' 
Gin Ser Lys Asn Leu Lys Phe Leu Arg Leu Asn Phe Tyr Thr Tyr Val 
440 445 450 

GCT TAA GAA ACC TCC AGA* AAA CAG ATA TTA AAA CAA GCT ACA ACA ATC 14 47 

Ala Gin Glu Thr Ser Arg Lys Gin lie Leu Lys Gin Ala Thr Thr lie 
455 460 465 

AAA AAT CTC AAA AAC AAT AAA AAT CAA GAA GAA ACT CCT GAA ACT AAA 14 95 

Lys Asn Leu Lys Asn Asn Lys Asn Gin Glu Glu Thr Pro Glu Thr Lys 
470 475 480 

GAT GAA ACT CCA AGC GAA AGC ACA AGT GGT ATG AAA TTT TTT GAT CAT 154 3 

Asp Glu Thr Pro Ser Glu Ser Thr Ser Gly Met Lys Phe Phe Asp His 
485 490 495 

CTT TCT GAA TTA ACC GAG CTT GAA GAT TTC AGC GTT AAC TTG TAA GCT 1591 
Leu Ser Glu Leu Thr Glu Leu Glu Asp Phe Ser Val Asn Leu Gin Ala 
500 505 510* 515 

ACC CAA GAA ATT TAT GAT AGC TTG CAC AAA CTT TTG ATT AGA TCA ACA 1639 
Thr Gin Glu He Tyr Asp Ser Leu His Lys Leu Leu He Arg Ser Thr 
520 525 530 

AAT TTA AAG AAG TTC AAA TTA AGT TAC AAA TAT GAA ATG GAA AAG AGT 1687 
Asn Leu Lys Lys Phe Lys Leu Ser Tyr Lys Tyr Glu Met Glu Lys Ser 
535 540 545 

AAA ATG GAT ACA TTC ATA GAT CTT AAG AAT ATT TAT GAA ACC TTA AAC 1735 
Lys Met Asp Thr Phe He Asp Leu Lys Asn He Tyr Glu Thr Leu Asn 
550 555 560 

AAT CTT AA-A AGA TGC TCT GTT PAT ATA TCA AAT CCT CAT GGA AAC ATT 17 8 3 

Asn Leu Lys Arg Cys Ser Val Asn He Ser Asn Pro His Gly Asn lie 
565 570 575 

TCT TAT GAA CTG ACA AAT AAA GAT TCT ACT TTT TAT AAA TTT AAG CTG 1831 
Ser Tyr Glu Leu Thr Asn Lys Asp Ser Thr Phe Tyr Lys Phe Lys Leu 
580 585 590 595 

ACC TTA AAC TAA GAA TTA TAA CAC GCT AAG TAT ACT TTT AAG TAG AAC 187 9 

Thr Leu Asn Gin Glu Leu Gin His Ala Lys Tyr Thr Phe Lys Gin Asn 
600 605 610 

GAA TTT TAA TTT AAT AAC GTT AAA AGT GCA AAA ATT GAA TCT TCC TCA 192 7 

Glu Phe Gin Phe Asn Asn Val Lys Ser Ala Lys He Glu Ser Ser Ser 
615 620 625 

TTA GAA AGC TTA GAA GAT ATT GAT AGT CTT TGC AAA TCT ATT GCT TCT 197 5 

Leu Glu Ser Leu Glu Asp lie Asp Ser Leu Cys Lys Ser lie Ala Ser 
630 635 640 

TGT AAA AAT TTA CAA AAT GTT AAT ATT ATC GCC AGT TTG CTC TAT CCC 2 02 3 

Cys Lys Asn Leu Gin Asn Val Asn He lie Ala Ser Leu Leu Tyr Pro 
645 650 655 



AAC AAT ATT TAG AAA AAT CCT TTC AAT .AAG CCC AAT CTT CTA TTT TTC 2071 
Asn Asn lie Gin Lys Asn Pro Phe Asn Lys Pro Asn Leu Leu Phe Phe 
660 665 670 675 

AAG CAA TTT GAA TAA TTG AAA AAT TTG GAA AAT GTA TCT ATC AAC TGT 2119 
Lys Gin Phe Glu Gin Leu Lys Asn Leu Glu Asn Val Ser lie Asn Cys 
680 685 690 

ATT CTT GAT CAG CAT ATA CTT AAT TCT ATT TCA GAA TTC TTA GAA AAG 2167 
lie Leu Asp Gin His lie Leu Asn Ser lie Ser Glu Phe Leu Glu Lys 
695 700 705 

AAT AAA AAA ATA AAA GCA TTC ATT TTG AAA AGA TAT TAT TTA TTA CAA 2215 
Asn Lys Lys lie Lys Ala Phe lie Leu Lys Arg Tyr Tyr Leu Leu Gin 
710 715 720 

TAT TAT CTT GAT TAT ACT AAA TTA TTT AAA ACA CTT CAA TAG TTA CCT 2263 
Tyr Tyr Leu Asp Tyr Thr Lys Leu Phe Lys Thr Leu Gin Gin Leu Pro 
• 725 730 . 735 

GAA TTA AAT TAA GTT TAC ATT AAT TAG CAA TTA GAA GAA TTG ACT GTG 2311 
Glu Leu Asn Gin Val Tyr He Asn Gin Gin Leu Glu Glu Leu Thr Val 
740 745 750 755 

AGT GAA GTA CAT AAG TAA GTA TGG GAA AAC CAC AAG CAA AAA GCT TTC 2 359 

Ser Glu Val His Lys Gin Val Trp Glu Asn His Lys Gin Lys Ala Phe 
760 765 770 

TAT GAA CCA TTA TGT GAG TTT ATC AAA GAA TCA TCC TAA ACC CTT TAG 24 07 

Tyr Glu Pro Leu Cys Glu Phe He Lys Glu Ser Ser Gin Thr Leu Gin 
775 780 735 

CTA ATA GAT TTT GAC CAA AAC ACT GTA AGT GAT GAC TCT ATT AAA AAG 24 55 

Leu lie Asp Phe Asp Gin Asn Thr Val Ser Asp Asp Ser He Lys Lys 
790 ' 795 800 

ATT TTA GAA TCT ATA TCT GAG TCT AAG TAT CAT CAT TAT TTG AGA TTG 2 503 

lie Leu Glu Ser He Ser Giu Ser Lys Tyr His His Tyr Leu Arg Leu 
805 810 815 

AAC CCT AGT TAA TCT AGC AGT TTA ATT AAA TCT GAA AAC GAA GAA. ATT 25 51 

Asn Pro Ser Gin Ser Ser Ser Leu He Lys Ser Glu Asn Glu Glu He 
820 825 830 835 

TAA GAA CTT CTC AAA GCT TGC GAC GAA AAA GGT GTT TTA GTA AAA GCA 2599 
Gin Glu Leu Leu Lys Ala Cys Asp Glu Lys Gly Val Leu Val Lys Ala 
840 845 850 

TAC TAT AAA TTC CCT CTA TGT TTA CCA ACT GGT ACT TAT TAC GAT TAC 264 7 

Tyr Tyr Lys Phe Pro Leu Cys Leu Pro Thr Gly Thr Tyr Tyr Asp Tyr 
855 860 '865 

AAT TCA GAT AGA TGG TGATTAATTA AATATTAGTT TAAATAAATA TTAAATATTG 2702 
Asn Ser Asp Arg Trp 
870 

AATATTTCTT TGCTTATTAT TTGAATAATA CATACAATAG TCATTTTTAG TGTTTTGAAT 27 62 
ATATTTTAGT TATTTAATTC ATTATTTTAA GTAAATAATT ATTTTTCAAT CATTTTTTAA 28 22 



AAAATCG 



2829 



155 



(2) INFORMATION FOR SEQ ID NO: 221: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 872 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 221: 

Met Ser Arg Arg Asn Gin Lys Lys Pro Gin Ala Pro lie Gly Asn Glu 
15 10 15 

Thr Asn Leu Asp Phe Val Leu Gin Asn Leu Glu Val Tyr Lys Ser Gin 
20 25 30 

He Glu His Tyr Lys Thr Gin Gin Gin Gin He Lys Glu Glu Asp Leu 
35 40 45 

Lys Leu Leu Lys Phe Lys Asn Gin Asp Gin Asp Gly Asn Ser Gly Asn 
50 55 60 

Aso Asd Asp Asp Glu Glu Asn Asn Ser Asn Lys Gin Gin Glu Leu Leu 
65 ' 70 75 80 

Arg Arg Val Asn Gin He Lys Gin Gin Val Gin Leu He Lys Lys Val 
85 90 95 

Gly Ser Lys Val Glu Lys Asp Leu Asn Leu Asn Glu Asp Glu Asn Lys 
100 105 110 

Lvs Asn Gly Leu Ser Glu Gin Gin Val Lys Glu Glu Gin Leu Arg Thr 
115 120 125 

lie Thr Glu Glu Gin Val Lys Tyr Gin Asn Leu Val Phe Asn Met Asp 
130 135 140 

Tvr Gin Leu Asp Leu Asn Glu Ser Gly Gly His Arg Arg His Arg Arg 
145 150 155 160 

Glu Thr Aso Tyr Asp Thr Glu Lys Trp Phe Glu He Ser His Asp Gin 
165 170 175 

Lys Asn Tyr Val Ser He Tyr Ala Asn Gin Lys Thr Ser Tyr Cys Trp 
180 185 190 

Trp Leu Lys Asp Tyr Phe Asn Lys Asn Asn Tyr Asp His Leu Asn Val 
195 200 205 

Ser He Asn Arg Leu Glu Thr Glu Ala Glu Phe Tyr Ala Phe Asp Asp 
210 215 220 

Phe Ser Gin Thr He Lys Leu Thr Asn Asn Ser Tyr Gin Thr Val Asn 
225 230 235 240 

He Asp Val Asn Phe Asp Asn Asn Leu Cys He Leu Ala Leu Leu Arg 
245 250 255 



Phe Leu Leu Ser Leu Glu Arg Phe Asn He Leu Asn He Arg Ser Ser 
260 265 270 



156 



Tyr Thr Arg Asn Gin Tyr Asn Phe Glu Lys He Gly Glu Leu Leu Glu 
275 280 285 

Thr He Phe Ala Val Val Phe Ser His Arg His Leu Gin Gly He His 
290 295 300 

Leu Gin Val Pro Cys Glu Ala Phe Gin Tyr Leu Val Asn Ser Ser Ser 
305 310 315 320 

Gin He Ser Val Lys Asp Ser Gin Leu Gin Val Tyr Ser Phe Ser Thr 
325 330 335 

Asp Leu Lys Leu Val Asp Thr Asn Lys . Val Gin Asp Tyr Phe Lys Phe 
340 345 350 

Leu Gin Glu Phe Pro Arg Leu Thr His Val Ser Gin Gin Ala He Pro- 
355 360 365 

Val Ser Ala Thr Asn Ala Val Glu Asn Leu Asn Val Leu Leu Lys Lys 
370 375 380 

Val Lys His Ala Asn Leu Asn Leu Val Ser He Pro Thr Gin Phe Asn 
385 390 395 400 

Phe Asp Phe Tyr Phe Val Asn Leu Gin His Leu Lys Leu Glu Phe Gly 
405 410 415 

Leu Glu Pro Asn He Leu Thr Lys Gin Lys Leu Glu Asn Leu Leu Leu 
420 425 430 

Ser lie Lys Gin Ser Lys Asn Leu Lys Phe Leu Arg Leu Asn Phe Tyr 
4 3 5 4 4 0 4 4 5 

Thr Tyr Val Ala Gin Glu Thr Ser Arg Lys Gin He Leu Lys Gin Ala 
450 455 460 

Thr Thr He Lys Asn Leu Lys Asn Asn Lys Asn Gin Glu Glu Thr Pro 
4 65 4 70 475 4 80 

Glu Thr Lys Asp Glu Thr Pro Ser Glu Ser Thr Ser Gly Met Lys Phe 
485 490 495 

Phe Asp His Leu Ser Glu Leu Thr Glu Leu Glu Asp Phe Ser Val Asn 
500 505 510 

Leu Gin Ala Thr Gin Glu He Tyr Asp Ser Leu His Lys Leu Leu He 
515 520 525 

Arg Ser Thr Asn Leu Lys Lys Phe Lys Leu Ser Tyr Lys Tyr Glu Met 
530 535 540 

Glu Lys Ser Lys Met Asp Thr Phe lie Asp Leu Lys Asn He Tyr Glu 
545 550 555 560 

Thr Leu Asn Asn Leu Lys Arg Cys Ser Val Asn lie Ser Asn Pro His 
565 570 575 

Gly Asn He Ser Tyr Glu Leu Thr Asn Lys Asp Ser Thr Phe Tyr Lys 
580 585 590 



157 



Phe Lys Leu Thr Leu Asn Gin Glu Leu Gin His Ala Lys Tyr Thr Phe 
595 600 605 

Lys Gin Asn Glu Phe Gin Phe Asn Asn Val Lys Ser Ala Lys He Glu 
610 615 620 

Ser Ser Ser Leu Glu Ser Leu Glu Asp He Asp Ser Leu Cys Lys Ser 
625 630 635 640 

He Ala Ser Cys Lys Asn Leu Gin Asn Val Asn He He Ala Ser Leu 
645 650 655 

Leu Tyr Pro Asn Asn He Gin Lys Asn Pro Phe Asn Lys Pro Asn Leu 
660 665 670 

Leu Phe Phe Lys Gin Phe Glu Gin Leu Lys Asn Leu Glu Asn Val Ser 
675 680 685 

He Asn Cys lie Leu Asp Gin His He Leu Asn Ser He Ser Glu Phe 
690 695 700 

Leu Glu Lys Asn Lys Lys He Lys Ala Phe He Leu Lys Arg Tyr Tyr 
705 710 715 720 

Leu Leu Gin Tyr Tyr Leu Asp Tvr Thr Lys Leu Phe Lys Thr Leu Gin 
725 ' 730 735 

Gin Leu Pro Glu Leu Asn Gin Val Tyr lie Asn Gin Gin Leu Glu Glu 
740 745 750 

Leu Thr Val Ser Glu Val His Lys Gin Val Trp Glu Asn His Lys Gin 

755 760 765 

Lvs Ala Phe Tyr Glu Pro Leu Cys Glu Phe He Lys Glu Ser Ser Gin 
770 775 730 

Thr Leu Gin Leu lie Aso Phe Aso Gin Asn Thr Val Ser Asp Asp Ser 
/ ,-. ,. 790 " 795 300 

He Lys Lys lie Leu Glu Ser He Ser Glu Ser Lys Tyr His His Tyr 
805 610 815 

Leu Arg Leu Asn Pro Ser Gin Ser Ser Ser Leu He Lys Ser Glu Asn 
320 825 830 

Glu Glu lie Gin Glu Leu Leu Lys Ala Cys Asp Glu Lys Gly Val Leu 
S35 840 845 

Val Lys Ala Tyr Tyr Lys Phe Pro Leu Cys Leu Pro Thr Gly Thr Tyr 
850 855 860 



Tvr: Asp Tyr Asn Ser Asp Arg Trp 
865 870 
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{2) INFORMATION FOR SEQ ID NO:222: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 884 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 



(ix) FEATURE : 

(A) NAME/KEY: Protein 

(B) LOCATION: 1..884 

(D) OTHER INFORMATION: /note= "Saccharomyces cerevisiae 

ESTp (L854 3.12)" 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO:222: 

Met Lys He Leu Phe Glu Phe He Gin Asp Lys Leu Asp He Asp Leu 
15 10 15 

Gin Thr Asn Ser Thr Tyr Lys Glu Asn Leu Lys Cys Gly His Phe Asn 
20 25 30 

Gly Leu Asd Glu He Leu Thr Thr Cys Phe Ala Leu Pro Asn Ser Arg 
35^ 40 4 5 

Lys lie Ala Leu Pro Cys Leu Pro Gly Asp Leu Ser His Lys Ala Val 
50 55 60 

Il<= Asp His Cys lie lie Tyr Leu Leu Thr Gly Glu Leu Tyr Asn Asn 
65~ 70 "75 30 

Val i.ei' Thr- Phe Gly Tvr Lys lie Ala Arg Asn Glu Asp Val Asn Asn 
85 ' 90 95 

S^r Leu Phe Cys His Ser Ala Asn Val Asn Val Thr Leu Leu Lys Gly 
100 105 HO 

Ala Ala Trp Lys Met Phe His Ser Leu Val Gly Thr Tyr Ala Phe Val 
115 120 125 

Asp Leu Leu He Asn Tyr Thr Val lie Gin Phe Asn Giy Gin Phe Phe 
130 135 1^0 

Thr Gin lie Val Gly Asn Arg Cys Asn Glu Pro His Leu Pro Pro Lys 
145 150 155 160 

Trp Val Gin Arg Ser Ser Ser Ser Ser Ala Thr Ala Ala Gin He Lys 
165 170 175 

Gin Leu Thr Glu Pro Val Thr Asn Lys Gin Phe Leu His Lys Leu Asn 
180 135 190 

lie Asn Ser Ser Ser Phe Phe Pro Tyr Ser Lys lie Leu Pro Ser Ser 
195 200 205 

Ser Ser lie Lys Lys Leu Thr Asp Leu Arg Glu Ala lie Phe Pro Thr 
210 215 220 
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Asn Leu Val Lys lie Pro Gin. Arg Leu Lys Val Arg lie Asn Leu Thr 
225 230 235 240 

Leu Gin Lys Leu Leu Lys Arg His Lys Arg Leu Asn Tyr Val Ser He 
245 250 255 

Leu Asn Ser He Cys Pro Pro Leu Glu Gly Thr Val Leu Asp Leu Ser 
260 265 270 

His Leu Ser Arg Gin Ser Pro Lys Glu Arg Val Leu Lys Phe He He 
275 280 285 

Val He Leu Gin Lys Leu Leu Pro Gin Glu Met Phe Gly Ser Lys Lys 
290 295 300 

Asn Lys Gly Lys He He Lys Asn Leu Asn Leu Leu Leu Ser Leu Pro 
305 310 315 320 

Leu Asn Gly Tyr Leu Pro Phe Asp Ser Leu Leu Lys Lys Leu Arg Leu 
325 330 335 

Lys Asp Phe Arg Trp Leu Phe He Ser Asp He Trp Phe Thr Lys His 
340 345 350 

Asn Phe Glu Asn Leu Asn Gin Leu Ala He Cys Phe He Ser Trp Leu 
355 360 365 

Phe Arg Gin Leu He Pro Lys He He Gin Thr Phe Phe Tyr Cys Thr 
370 375 380 

Glu He Ser Ser Thr Val Thr He Val Tyr Phe Arg His Asp Thr Trp 
33 5 390 395 400 

Asn Lvs Leu He Thr Pro Phe He Val Glu Tyr Phe Lys Thr Tyr Leu 
4 05 410 415 

val GLu Asn Asn Val Cys Arg Asn His Asn Ser Tyr Thr Leu Ser Asn 
420 ' ' 425 4 30" 

Phe Asn His Ser Lys Met Arc lie lie Pro Lys Lys Ser Asn Asn Glu 
4 3 5 4 4 0 4 4 5 

Phe Arg lie He Ala He Pro Cys Arg Gly Ala Asp Glu Glu Glu Phe 
450 455 460 

Thr He Tyr Lys Glu Asn His Lys Asn Ala He Gin Pro Thr Gin Lys 
465 470 475 480 

He Leu Glu Tyr Leu Arg Asn Lys Arg Pro Thr Ser Phe Thr Lys He 
485 490 495 

Tyr Ser Pro Thr Gin He Ala Asp Arg lie Lys Glu Phe Lys Gin Arg 
500 505 510 

Leu Leu Lys Lys Phe Asn Asn Val Leu Pro Glu Leu Tyr Phe Met Lys 
515 520 525 

Phe Asp Val Lys Ser Cys Tyr Asp Ser He Pro Arg Met Glu Cys Met 
530 535 540 
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V 



Arg lie Leu Lys Asp Ala Leu Lys Asn Glu Asn Gly Phe Phe Val Arg 
545 550 555 560 

Ser Gin Tyr Phe Phe Asn Thr Asn Thr Gly Val Leu Lys Leu Phe Asn 
565 570 575 

Val Val Asn Ala Ser Arg Val Pro Lys Pro Tyr Glu Leu Tyr lie Asp 
580 585 590 

Asn Val Arg Thr Val His Leu Ser Asn Gin Asp Val lie Asn Val Val 

595 600 605 

Glu Met Glu lie Phe Lys Thr Ala Leu Trp Val Glu Asp Lys Cys Tyr 

610 615 620 

lie Arg Glu Asp Gly Leu Phe Gin Gly Ser Ser Leu Ser Ala Pro He. 
625 . 630 635 640 

Val Asp Leu Val Tyr Asp Asp Leu Leu Glu Phe Tyr Ser Glu Phe Lys 
645 650 655 

Ala Ser Pro Ser Gin Asp Thr Leu He Leu Lys Leu Ala Asp Asp Phe 
660 665 670 

Leu lie lie Ser Thr Asp Gin Gin Gin Val lie Asn He Lys Lys Leu 
675 630 635 

Ala Mec Gly Gly Phe Gin Lys Tyr Asn Ala Lys Ala Asn Arg Asp Lys 
690 695 700 

He Leu Ala Val Ser Ser Gin Ser Asp Asp Asp Thr Val He Gin Phe 

705 710 715 720 

Cys Ala Met His He Phe Val Lys Glu Leu Glu Val Trp Lys His Ser 
725 730 735 

Ser Thr Met Asn Asn Phe His He Arg Ser Lys Ser Ser Lys Gly He 
740 745. 750 

Phe Arg Ser Leu lie Ala Leu Phe Asn Thr Arg He Ser Tyr Lys Thr 
755 760 ' 765 

He Asp Thr Asn Leu Asn Ser Thr Asn Thr Val Leu Met Gin He Asp 
770 775 780 

His Val Val Lys Asn He Ser Glu Cys Tyr Lys Ser Ala Phe Lys Asp 
735 790 795 800 

Leu Ser He Asn Val Thr Gin Asn Met Gin Phe His Ser Phe Leu Gin 
805 810 815 

Arg He He Glu Met Thr Val Ser Giy Cys Pro He Thr Lys Cys Asp 
820 825 830 



Pro Leu He Glu Tyr Glu Val Arg Phe Thr lie Leu Asn Gly Phe Leu 
835 340 345 



Glu Ser Leu Ser Ser Asn Thr Ser Lys Phe Lys Asp Asn He He Leu 
350 855 " 860 
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Leu Arg Lys Glu He Gin His Leu Gin Ala Tyr He Tyr He Tyr He 
865 870 875 880 

His He Val Asn 



(2) INFORMATION FOR SEQ ID NO: 223: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 42 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: peptide 



(xi) SEQUENCE DESCRIPTION : SEQ ID NO: 223: 

Leu Cys Val Ser Tyr He Leu Ser Ser Phe Tyr Tyr Ala Asn Leu Glu 
15 10 15 

Glu Asn Ala Leu Gin Phe Leu Arg Lys Glu Ser Met Asp Pro Glu Lys 
20 25 30 

Pro Glu Thr Asn Leu Leu Met Arg Leu Thr 
35 40 



INFORMATION FOR SEQ ID MO:224: 

(i.) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 42 amino acids 
(3) TYPE: amino acid 
(CJ STRANDEDNESS : 
(D) TOPOLOGY : linear 

(ii) MOLECULE TYPE: peptide 



SEQUENCE DESCRIPTION: SEQ ID NO: 224: 

Leu Cys Val Ser Ser He Leu Ser Ser Phe Tyr Tyr Ala Thr Leu Glu 
15 10 15 

Glu Ser Ser Leu Gly Phe Leu Arg Asp Glu Ser Met Asn Pro Giu Asn 
20 25 30 

Pro Asn Val Asn Leu Leu Met Arg Leu Thr 
35 40 



(2) INFORMATION FOR SEQ ID NO: 22 5: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 2631 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 
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(ix) FEATURE: 

(A) NAME/KEY: - 

(B) LOCATION: 1 . .2631 

(D) OTHER INFORMATION: /note= "Saccharomyces cerevisiae 

EST2" 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:225: 

ATTTATACTC ATGAAAATCT TATTCGAGTT CATTCAAGAC AAGCTTGACA TTGATCTACA 60 

GACCAACAGT ACTTACAAAG AAAATTTAAA ATGTGGTCAC TTCAATGGCC TCGATGAAAT 120 

TCTAACTACG TGTTTCGCAC TACCAAATTC AAGAAAAATA GCATTACCAT GCCTTCCTGG 180 

TGACTTAAGC CACAAAGCAG TCATTGATCA CTGCATCATT TACCTGTTGA CGGGCGAATT 24 0 

ATACAACAAC GTACTAACAT TTGGCTATAA AATAGCTAGA AATGAAGATG TCAACAATAG 300 

TCTTTTTTGC CATTCTGCAA ATGTTAACGT TACGTTACTG AAAGGCGCTG CTTGGAAAAT 360 

GTTCCACAGT TTGGTCGGTA CATACGCATT CGTTGATTTA TTGATCAATT ATACAGTAAT 4 20 

TCAATTTAAT GGGCAGTTTT TCACTCAAAT CGTGGGTAAC AGATGTAACG AACCTCATCT 4 80 

GCCGCCCAAA TGGGTCCAAC GATCATCCTC ATCATCCGCA ACTGCTGCGC AAATCAAACA 54 0 

ACTTACAGAA CCAGTGACAA ATAAACAATT CTTACACAAG CTCAATATAA ATTCCTCTTC 600 

TTTTTTTCCT TATAGCAAGA TCCTTCCTTC ATCATCATCT ATCAAAAAGC TAACTGACTT 660 

GAGAGAA.GCT ATTTTTCCCA CAAATTTGGT TAAAATTCCT CAGAGACTAA AGGTACGAAT 720 

TAATTTGACG CTGCAAAAGC TATTAAAGAG ACATAAGCGT TTGAATTA.CG TTTCTATTTT 7 80 

GAATAGTATT TGCCCACCAT TGG.AAGGGAC CGTATTGGAC TTGTCGCATT TGAGTAGGCA 84 0 

ATCACCAAAG GAACGAGTCT TGAAATTTAT CATTGTTATT TTACAGAAGT TATTACCCCA 900 

AGAAATGTTT GGCTCAAAGA AAAATAAAGG AAAAATTATC AA G AA.T C T A A ATCTTTTATT 960 

AAGTTTACCC TTAAATGGCT ATTTACCATT TGATAGTTTG TTGAAAAAGT T AAG ATT AAA 1020 

GGATTTTCGG TGGTTGTTCA TTTCTGATAT TTGGTTCACC AAGCACAATT TTGAAAACTT 1080 

GAATCAATTG GCGATTTGTT TCATTTCCTG GCTATTTAGA CAACTAATTC CCAAAATTAT 1140 

ACAGACTTTT TTTTACTGCA CCGAAATATC TTCTACAGTG ACAATTGTTT ACTTTAGACA 1200 

TGATACTTGG AATAAACTTA TCACCCCTTT TATCGTAGAA TATTTTAAGA CGTACTTAGT 12 60 

CGAAAACAAC GTATGTAGAA ACCATAATAG TTACACGTTG TCCAATTTCA ATCATAGCAA 1320 

AATGAGGATT ATACCAAAAA AAAGTAATAA TGAGTTCAGG ATTATTGCCA TCCCATGCAG 138 0 

AGGGGCA.GAC GAAGAAGAAT TCACAATTTA TAAGGAGAAT CACAAAAATG CTATCCAGCC 14 40 

CACTCAAAAA ATTTTAGAAT AC CT AAG AAA CAAAAGGCCG ACTAGTTTTA CTAAAATATA 1500 

TTCTCCAACG CAAATAGCTG ACCGTATCAA AGAATTTAAG CAGAGACTTT TAAAGAAATT 1560 
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TAATAATGTC TTACCAGAGC TTTATTTCAT GAAATTTGAT GTCAAATCTT GCTATGATTC 1620 

CATACCAAGG ATGGAATGTA TGAGGATACT CAAGGATGCG CTAAAAAATG AAAATGGGTT 1680 

TTTCGTTAGA TCTCAATATT TCTTCAATAC CAATACAGGT GTATTGAAGT TATTTAATGT 1740 

TGTTAACGCT AGCAGAGTAC CAAAACCTTA TGAGCTATAC ATAGATAATG TGAGGACGGT 1800 

TCATTTATCA AATCAGGATG TTATAAACGT TGTAGAGATG GAAATATTTA AAACAGCTTT 18 60 

GTGGGTTGAA GATAAGTGCT ACATTAGAGA AGATGGTCTT TTTCAGGGCT CTAGTTTATC 1920 

TGCTCCGATC GTTGATTTGG TGTATGACGA TCTTCTGGAG TTTTATAGCG AGTTTAAAGC 1980 

CAGTCCTAGC CAGGACACAT TAATTTTAAA ACTGGCTGAC GATTTCCTTA TAATATCAAC 204 0 

AGACCAACAG CAAGTGATCA ATATCAAAAA GCTTGCCATG GGCGGATTTC AAAAATATAA 2100 

TGCGAAAGCC AATAGAGACA AAATTTTAGC CGTAAGCTCC " CAATCAGATG ATGATACGGT 2160 

TATTCAATTT TGTGCAATGC ACATATTTGT TAAAGAATTG GAAGTTTGGA AACATTCAAG 2220 

CACAATGAAT AATTTCCATA TCCGTTCGAA ATCTAGTAAA GGGATATTTC GAAGTTTAAT 2280 

AGCGCTGTTT AACACTAGAA TCTCTTATAA AACAATTGAC ACAAATTTAA ATTCAACAAA 234 0 

CACCGTTCTC ATGCAAATTG ATCATGTTGT AAAGAACATT TCGGAATGTT ATAAATCTGC 2 4 00 

TTTTAAGGAT CTATCAATTA ATGTTACGCA AAATATGCAA TTTCATTCGT TCTTACAACG 24 60 

CATCATTGAA ATGACAGTCA GCGGTTGTCC AATTACGAAA TGTGATCCTT TAATCGAGTA 2520 

TGAGGTACGA TTCACCATAT TGAATGGATT TTTGGAAAGC CTATCTTCAA A C AC AT C AAA 2 580 

ATTTAAAGAT AATATCATTC TTTTGAGAAA GGAAATTCAA CACTTGCAAG C 2 631 

{2} INFORMATION TOR SEQ ID NO: 226: 

(l) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 157 amino acids 
(3) TYPE: amino acid 
;C) ST RAND EDM ESS : 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: peptide 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:226: 
Phe Phe Tyr Cys Thr Glu He Ser Ser Thr .Val Thr lie Val Tyr Phe 



Arg His Asp Thr Trp Asn Lys Leu He Thr Pro Phe He Val Glu Tyr 
20 25 30 

Phe Lys Thr Tyr Leu Val Glu Asn Asn Val Cys Arg Asn His Asn Ser 



Tyr Thr Leu Ser Asn Phe Asn His Ser Lys Met Arg He He Pro Lys 
50 55 60 
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Lys Ser Asn Asn Glu Phe Arg lie 
65 70 

Asp Glu Glu Glu Phe Thr He Tyr 
85 

Gin Pro Thr Gin Lys He Leu Glu 
100 

Ser Phe Thr Lys He Tyr Ser Pro 

115 120 

Glu Phe Lys Gin Arg Leu Leu Lys 
130 135 

Leu Tyr Phe Met Lys Phe Asp Val 
145 . 150 



He Ala He Pro Cys Arg Gly Ala 
75 80 

Lys Glu Asn His Lys Asn Ala He 
90 95 

Tyr Leu Arg Asn Lys Arg Pro Thr 
105 110 

Thr Gin lie Ala Asp Arg He Lys 
125 

Lys Phe Asn Asn Val Leu Pro Glu 
140 

Lys Ser Cys Tyr Asp 
155 



(2) INFORMATION FOR SEQ ID NO: 227: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 155 amino acids 
(3) TYPE: amino acid 

(C) STRANDEDNESS: 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: peptide 



(:<i) SEQUENCE DESCRIPTION: SEQ ID NO:227: 

Phe Phe Tyr Val Thr Glu Gin Gin Lys Ser Tyr Ser Lys Thr Tyr Tyr 
I 5 10 15 

Tyr Arg Lys Asn He Trp Asp Val He Met Lys Met Ser lie Ala Asp 
20 25 30 

Leu Lys Lys Glu Thr Leu Ala Glu Val Gin Glu Lys Glu Val Glu Glu 
3 5 4 0 4 5 

Trp Lys Lys Ser Leu Gly Phe Ala Pro Gly Lys Leu Arg Leu He Pro 
50 55 60 

Lys Lys Thr Thr Phe Arg Pro He Met Thr Phe Asn Lys Lys He Val 
65 70 75 80 

Asn Ser Asp Arg Lys Thr Thr Lys Leu Thr Thr Asn Thr Lys Leu Leu 
85 90 95 

Asn Ser His Leu Met Leu Lys Thr Leu Lys Asn Arg Met Phe Lys Asp 
100 105 110 

Pro Phe Gly Phe Ala Val Phe Asn Tyr Asp Asp Val Met Lys Lys Tyr 
115 120 125 

Glu Glu Phe Val Cys Lys Trp Lys Gin Val Gly Gin Pro Lys Leu Phe 
130 135 140 

Phe Ala Thr Met Asp He Glu Lys Cys Tyr Asp 
145 150 155 
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(2) INFORMATION FOR SEQ ID NO: 228: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 158 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: peptide 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 228: 

Lys His Lys Glu Gly Ser Gin He Phe Tyr Tyr Arg Lys Pro He Trp 
15 10 15 

Lys Leu Val Ser Lys Leu Thr He Val Lys Val Arg He Gin Phe Ser 
20 25 30 

Glu Lys Asn Lys Gin Met Lys Asn Asn Phe Tyr Gin Lys He Gin Leu 
35 40 45 

Glu Glu Glu Asn Leu Glu Lys Val Glu Glu Lys Leu lie Pro Glu Asp 
50 55 60 

Ser Phe Gin Lys Tyr Pro Gin Gly Lys Leu Arg lie lie Pro Lys Lys 
65 70 75 80 

Gly Ser Phe Arg Pro He Met Thr Phe Leu Arg Lys Asp Lys Gin Lys 
85 90 95 

Asn He Lys Leu Asn Leu Asn Gin He Leu Met Asp Ser Gin Leu Val 
100 105 * 110 

Phe Arg Asn Leu Lys Asp Met Leu Gly Gin Lys lie Gly Tyr Ser Val 
115 * 120 125 

Phe Asp Asn Lys Gin lie Ser Glu Lys Phe Ala Gin Phe lie Glu Lys 
130 " 135 140 

Trp Lys Asn Lys Gly Arg Pro Gin Leu Tyr Tyr Val Thr Leu 
145 150- 155 



(2) INFORMATION FOR SEQ ID NO: 229: 

(i) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 5 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: peptide 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO:229: 



Tyr Tyr Tyr Arg Lys 
1 5 
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(2) INFORMATION FOR SEQ ID NO:230: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH; 9 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: peptide 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 230: 

Gly Lys Leu Arg lie lie Pro Lys Lys 
1 5 



(2) IN FORMATION FOR SEQ ID NO: 231: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 7 amino acids 

( B ) TYPE: amino acid 

(C) STRANDEDNESS: 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: peptide 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 231: 

Phe Arg Pro lie Met Thr Phe 

1. ' 5 



(2) INFORMATION FOR SEQ ID NO: 232: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 2 4 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: peptide 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 232: 

Phe Phe Tyr Val Thr Glu Thr Thr Phe Gin Lys Asn Arg Leu Phe Phe 
15 10 15 

Tyr Arg Lys Ser Val Trp Ser Lys 
20 

(2) INFORMATION FOR SEQ ID NO: 233: 

(i) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 23 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: 

(D) TOPOLOGY: linear 
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(ii) MOLECULE TYPE: peptide 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO:233: 

Arg Gin His Leu Lys Arg Val Gin Leu Arg Asp Val Ser Glu Ala Glu 
1 5 10 15 

Val Arg Gin His Arg Glu Ala 
20 



(2) INFORMATION FOR SEQ ID NO: 234: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 27 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: peptide 



(xi) SEQUENCE DESCRIPTION: SEQ ID MO:234: 

Ala Arg Thr Phe Arg Arg Glu Lys Arg Ala Glu Arg Leu Thr Ser Arg 
15 10 15 

Val Lys Ala Leu Phe Ser Val Leu Asn Tyr Glu 
20 25 



(2) INFORMATION FOR SEQ ID NO: 2 35: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH : 23 amino acids 

(3) TYPE: amino acid 

(C) STRANDEDNESS : 

{ D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: peptide 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 235: 

Ala Lys Phe Leu His Trp Leu Met Ser Val Tyr Val Val Glu Leu Leu 
1 ' 5 10 15 

Arg Ser Phe Phe Tyr Val Thr Glu Thr Thr Phe Gin 
20 25 



(2) INFORMATION FOR SEQ ID MO: 2 36: 

(l) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 30 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: 

( D) TOPOLOGY: linear 



(ii) MOLECULE TYPE: peptide 
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(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 236: 

Leu Phe Phe Tyr Arg Lys Ser Val Trp Ser Lys Leu Gin Ser He Gly 
1 5 10 15 

He Arg Gin His Leu Lys Arg Val Gin Leu Arg Asp Val Ser 
20 25 30 



(2) INFORMATION FOR SEQ ID NO: 237: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 27 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : 

{ D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: peptide 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO:237: 

Pro Ala Leu Leu Thr Ser Arg Leu Arg Phe He Pro Lys Pro Asp Gly 
I 5 10 15 

Leu Arg Pro He Val Asn Met Asp Tyr Val Val 
20 25 



(2) INFORMATION FOR SEQ ID NO: 2 33: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 23 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DMA 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 238: 
YARACHAARG GHAT YCCHYA RGG 



(2) INFORMATION FOR SEQ ID MO:239: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 8 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: peptide 



(xi) SEQUENCE DESCRIPTION: SEQ ID MO:239: 



Gin Thr Lys Gly He Pro Gin Gly 
1 5 
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(2) INFORMATION FOR SEQ ID NO:240: 

(i) SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 7 amino acids 
{ B) TYPE: amino acid 

(C) STRANDEDNESS: 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: peptide 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO:240:. 

Asp Asp Tyr Leu Leu lie Thr 
1 5 

(2) INFORMATION FOR SEQ ID NO: 241: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 21 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 



(xi) SEQUENCE DESCRIPTION: SEQ ID MO:241: 
NGTNATDARD ARRTARTCRT C 



(2) INFORMATION FOR SEQ ID NO: 242: 

(i) SEQUENCE CHARACTERISTICS: . 

(A) LENGTH : 55 amino acids 
(3) TYPE: amino acid 

(C) STRANDEDNESS : 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: peptide 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO:242: 

Lys Gly He Pro Gin Gly Leu Cys Val Ser Ser He Leu Ser Ser Phe 
15 10 15 

Tyr Tvr Ala Thr Leu Glu Glu Ser Ser Leu Gly Phe Leu Arg Asp Glu 
20 25 30 

Ser Met Asn Pro Glu Asn Pro Asn Val Asn Leu Leu Met Arg Leu Thr 
35 40 45 



Asp Asp Tyr Leu Leu He Thr 
50^ 55 
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(2) INFORMATION FOR SEQ ID NO: 24 3: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 34 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: peptide 



(xi) SEQUENCE DESCRIPTION: SEQ ID MO:243: 

Ser He Leu Ser Ser Phe LeuCys His Phe Tyr Met Glu Asp Leu He 
1 5 10 15 

Asp Glu Tyr Leu Ser Phe Thr Lys Lys Lys Gly Ser Val Leu Leu Arg 
20 25 30 

Val Val 



;2j INFORMATION FOR SEQ ID NO: 244: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 49 amino acids 
(3) TYPE: amino acid 

(C) STRANDEDNESS : 

(D) TOPOLOGY : linear 

(ii) MOLECULE TYPE: peptide 



;>:i) SEQUENCE DESCRIPTION: SEQ ID MO: 24 4: 

asp GW Leu Phe Gin Gly Ser Ser Leu Ser Ala Pro lie Val Asp Leu 
1 5 10 15 

Val Tyr Asp Asd Leu Leu Glu Phe Tyr Ser Glu Phe Lys Ala Ser Pro 
20* 25 30 

Ser Gin Asp Thr Leu He Leu Lys Leu Ala Asp Asp Phe Leu He lie 
35 40 45 

Ser 



(2) INFORMATION FOR SEQ ID NO: 245: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH : 8 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: peptide 
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(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 245: 

Gin Lys Val Gly lie Pro Gin Gly 
1 ' 5 



(2) INFORMATION FOR SEQ ID NO: 246: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 23 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO:246: 
CAAAAAGTTG GTATCCCTCA GGG 



(2) INFORMATION FOR SEQ ID NQ:247: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 146 base pairs 
(3) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii.) MOLECULE TYPE: DMA (genomic) 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 247: 

AGACCAAAGG AATTCCATCA GGCTCAATTC TGTCATCTTT TTTGTGTCAT TTCTATATGG 60 

AAGAT TTGAT TGATGAATAC CTATCGTTTA CGAAAAAGAA AGGATCAGTG TTGTTACGAG 120 

TAGTCGACGA CTACCTCCTC ATCACC . 14 6 



(2) INFORMATION FOR SEQ ID MO:243: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 47 amino acids 
(3) TYPE: amino acid 

(C) STRANDEDNESS: 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: peptide 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 243: 

Lys Gly He Pro Ser Gly Ser He Leu Ser Ser Phe Leu Cys His Phe 
15 10 15 



Tyr Met Glu Asp Leu He Asp Glu Tyr Leu Ser Phe Thr Lys Lys Lys 
20 25 30 
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Gly Ser Val Leu Leu Arg Val Val Asp Asp Tyr Leu Leu He Thr 
35 40 45 



(2) INFORMATION FOR SEQ ID NO:249: 

(i) SEQUENCE CHARACTERISTICS: ' 

(A) LENGTH: 21 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO:249: 
GACGATTTCC TCTTTATAAC A 



(2) INFORMATION FOR SEQ ID NO: 250: 

{1} SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 7 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: peptide 



(xi) SEQUENCE DESCRIPTION: SEQ ID MO:250: 

Aso Asp Phe Leu Phe He Thr 
I " 5 



INFORMATION FOR SEQ ID NO: 251: 

(l) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 35 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: peptide 



(ix) FEATURE : 

(A) NAME/KEY : Peptide 

(B) LOCATION: 1 . . 35 

(D) OTHER INFORMATION: /note= "motif 0 peptide from 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO:251: 

Trp Leu Tyr Asn Ser Phe He He Pro He Leu Gin Ser Phe Phe Tyr 



Schizosaccharomyces pombe tezlp 



1 



5 



10 



15 
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lie Thr Glu Ser Ser Asp Leu Arg Asn Arg Thr Val Tyr Phe Arg Lys 
20 25 30 

Asp lie Trp 
35 



(2) EM FORMATION FOR SEQ ID NO: 252: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 22 amino acids 
(3) TYPE: amino acid 

(C) STRANDEDNESS: 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: peptide 



(ix) FEATURE: 

(A) NAME/KEY: Peptide 
(3) LOCATION: 1..22 

( D) OTHER INFORMATION : /note= "motif 1 and 2 peptide from 

Schi zosaccharomyces pombe tezlp" 



(:*:i) SEQUENCE DESCRIPTION: SEQ ID MO: 252: 

Ala Val lie Arg Leu Leu Pro Lys Lys Asn Thr Phe Arg Leu lie Thr 
15 10 15 

Asn Leu Arg Lys Arg Phe 
20 



^FORMATION FOR SEQ ID NO: 253: 

( L } SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 26 amino acids 
(3) TYPE: amino acid 
(C) STRANDEDNESS: 
;D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: peptide 



;ix) FEATURE: 

(A) NAME /KEY : Peptide 
(3) LOCATION: 1. .26 

(D) OTHER INFORMATION: /note- "motif 3(A) peptide from 

Schizosaccharomyces pombe tezlp" 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 253: 

Lys Lys Tyr Phe Val Arg lie Asp lie Lys Ser Cys Tyr Asp Arg lie 
1 5 10 * 15 

Lys Gin Asp Leu Met Phe Arg lie Val Lys 
20 25 
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(2) INFORMATION FOR SEQ ID NO: 254: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 32 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: peptide 



(ix) FEATURE: 

(A) NAME/KEY: Peptide 

(B) LOCATION: 1 . . 32 

(D) OTHER INFORMATION: /note- "motif 4(B ! ) peptide from 

Schizosaccharomyces pombe tezlp" 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 254: 

Tyr Leu Gin Lys Val Gly He Pro Gin Gly Ser He Leu Ser Ser Phe 
15 10 15 

Leu Cys His Phe Tyr Met Glu Asp Leu He Asp Glu Tyr Leu Ser Phe 
20 25 30 



[2) INFORMATION FOR SEQ ID NO: 255: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 4 9 amino acids 
(3) TYPE: amino acid 

(C) STRANDEDNESS : 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: peptide 



( ix) FEATURE: 

(A) NAME/KEY: Peptide 
(3) LOCATION: 1. . 4 9 

(D) OTHER INFORMATION: /note= "motif 5(C) and 6(D) peptide from 

Schizosaccharomyces pombe tezlp" 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO:255: 

Val Leu Leu Arg Val Val Asp Asp Phe Leu Phe He Thr Val Asn Lys 

1 5 10 15 

Lys Asp Ala Lys Lys Phe Leu Asn Leu Ser Leu Arg Gly Phe Glu Lys 
20 25 30, 

His Asn Phe Ser Thr Ser Leu Glu Lys Thr Val He Asn Phe Glu Asn 
35 40 45 



Ser 
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(2) INFORMATION FOR SEQ ID NO: 256: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 34 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: peptide 



(ix) FEATURE: 

(A) NAME/KEY: Peptide 

(B) LOCATION: 1...34 

(D) OTHER INFORMATION: /note= "motif 0 peptide from 

Saccharomyces cerevisiae EST2p" 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO:256: 

Trp Leu Phe Arg Gin Leu He Pro Lys He lie Gin Thr Phe Phe Tyr 
15 10 15 

Cys Thr Glu He Ser Ser Thr Val Thr He Val Tyr Phe Arg His Asp 
20 25 > 30 

Thr Trp 



(2) INFORMATION FOR SEQ ID NO: 257: 

(i) SEQUENCE CHARACTERISTICS: 

{ A ) LENGTH: 25 amino acids 
(3) TYPE: amino acid 
•:C) STRANDEDNESS : 
(j) TOPOLOGY: linear 

(ii) MOLECULE TYPE: peptide 



(ix) FEATURE : 

(A) NAME /KEY: Peptide 
(3) LOCATION: 1 . . 25 

(D) OTHER INFORMATION : /note= "motif 1 and 2 peptide from 

Saccharomyces cerevisiae EST2p" 



(xi) SEQUENCE DESCRIPTION: SEQ ID MO: 257: 

Ser Lvs Met Arg He lie Pro Lys Lys Ser Asn Asn Glu Phe Arg He 
1*5 10 15 



lie Ala He Pro Cys Arg Gly 'Ala Asp 
20 25 




(2) INFORMATION FOR SEQ ID NO: 258: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 26 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: peptide 



(ix) FEATURE: 

(A) NAME/KEY: Peptide 

(B) LOCATION: 1..26 

(D) OTHER INFORMATION: /note= "motif 3(A) peptide from 

Saccharomyces cerevisiae EST2p" 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 258: 

Glu Leu Tyr Phe Met Lys Phe Asp Val Lys Ser Cys Tyr Asp Ser He 
1 5 10 .15 

Pro Arg Met Glu Cys Met Arg He Leu Lys 
20 25 



(2} INFORMATION FOR SEQ ID NO: 259: 

(i) SEQUENCE: CHARACTERISTICS: 

(A) LENGTH: 32 amino acids 
(3) TYPE: amino acid 

(C) STRANDEDNESS : 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: peptide 



(ix) FEATURE: 

(A) NAME/KEY : Peptide 

(B) LOCATION: 1 . . 32 

(D) OTHER INFORMATION: /note= "motif 4(8') peptide from 

Saccharomyces cerevisiae EST2p M 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 259: 

Tyr He Ara Glu Asp Gly Leu Phe Gin Gly Ser Ser Leu Ser Ala Pro 
1 , ' 5 L 10 15 

He Val Aso Leu Val Tyr Asp Asp Leu Leu Glu Phe Tyr Ser Glu Phe 
20 25 30 



(2) INFORMATION FOR SEQ ID NO:260: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH : 49 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: 

(D) TOPOLOGY: linear 
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(ii) MOLECULE TYPE: peptide 



(ix) FEATURE: 

(A) NAME/KEY: Peptide 

(B) LOCATION: 1..49 

(D) OTHER INFORMATION: /note= "motif 5(C) peptide from 

Saccharomyces cerevisiae EST2p" 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO:260: 

Leu lie Leu Lys Leu Ala Asp Asp Phe Leu lie lie Ser Thr Asp ; Gin 
15 10 15 

Gin Gin Val He Asn He Lys Lys Leu Ala Met Gly Gly Phe Gin Lys 
20 25 30 

Tyr Asn Ala Lys Ala Asn Arg Asp Lys He Leu Ala Val Ser Ser Gin 
35 40 45 

Ser 



(2) INFORMATION FOR SEQ ID NO:261: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 35 amino acids 
(3) TYPE: amino acid 

(C) STRANDEDNESS: 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: peptide 



( ix) FEATURE: 

{ A ) MA M E/KEY : Peptide 
(3) LOCATION: 1 . . 35 

(D) OTHER INFORMATION: /note- "motif 0 peptide from 

Euplotes aediculatus p!23" 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 261: 

Trp He Phe Glu Asp Leu Val Val Ser Leu He Arg Cys Phe Phe Tyr 
I 5 10 15 

Val Thr Glu Gin Gin Lys Ser Tyr Ser Lys Thr Tyr Tyr Tyr Arg Lys 
20 ' 25 30 

Asn lie Trp 
35 



(2) INFORMATION FOR SEQ ID NO: 262: 

(l) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 23 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: 

(D) TOPOLOGY: linear 
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(ii) MOLECULE TYPE: peptide 



(ix) FEATURE: 

(A) NAME/KEY: Peptide 

(B) LOCATION: 1..23 

(D) OTHER INFORMATION: /note= "motif 1 and 2 peptide from 

Euplotes aediculatus pl23" 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 262: 

Gly Lys Leu Arg Leu lie Pro Lys Lys Thr Thr Phe Arg Pro lie Met 
15 10 15 

Thr Phe Asn Lys Lys lie Val 
20 



(2) INFORMATION FOR SEQ ID NO: 263: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 26 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: peptide 



fix) FEATURE: 

(A) NAME /KEY : Peptide 
(3) LOCATION: 1 . . 26 

(D) OTHER INFORMATION: /note= "motif 3(A) peptide fro: 

Euoiotes aediculatus ol2 3" 



(:-:i) SEQUENCE DESCRIPTION: SEQ ID MO:263: 

Lys Leu Phe Phe Ala Thr Met Asp lie Glu Lys Cys Tyr Asp Ser Val 
I 5 * 10 15 

Asn Arg Glu Lys Leu Ser Thr Phe Leu Lys 
20 25 



{2} INFORMATION FOR SEQ ID NO: 264: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 32 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: 

(D) TOPOLOGY: linear 



(ii) MOLECULE TYPE: peptide 
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(ix) FEATURE: 

(A) NAME/KEY: Peptide 

(B) LOCATION: 1 . . 32 

(D) OTHER INFORMATION: 7note= "motif 4(B') peptide from 

Euplotes aediculatus p!23" 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO:264: 

Tyr Lys Gin Thr Lys Gly lie Pro Gin Gly Leu Cys Val Ser Ser lie 
15 10 15 

Leu Ser Ser Phe Tyr Tyr Ala Thr Leu Glu Glu Ser Ser Leu Gly Phe 
20 25 30 



(2) INFORMATION FOR SEQ ID NO:265: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 49 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: 

(D) TOPOLOGY: linear 

(li) MOLECULE TYPE: peptide 



; ix) FEATURE: 

(A) NAME/KEY: Peptide 
(3) LOCATION: 1 . . 49 

(D) OTHER INFORMATION: /note= "motif 5(C) and 6(D) peptide from 

Euplotes aediculatus p!23" 



SEQUENCE DESCRIPTION: SEQ ID NO : 2 6 5 : 

Leu Leu Met Arg Leu Thr Asp Asp Tyr Leu Leu lie Thr Thr Gin Glu 

I 5 10 15 

Asn Asn Ala Val Leu Phe He Glu Lys Leu He Asn Val Ser Arg Glu 
20 25 30 

Asn Gly Phe Lys Phe Asn Men Lys Lys Leu Gin Thr Ser Phe Pro Leu 
35 40 45 

Ser 



(2) INFORMATION FOR SEQ ID NO: 266: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 2171 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 



(ii) MOLECULE TYPE: DNA (genomic) 
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(ix) FEATURE: 

(A) NAME/KEY: CDS 

(B) LOCATION: 22. . 1716 

(D) OTHER INFORMATION: /note- "EcoRI-NotI insert of 

clone 712562 encoding 63 kDa 
telomerase protein" 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 266: 

GCCAAGTTCC TGCACTGGCT GATGAGTGTG TACGTCGTCG AGCTGCTCAG GTCTTTCTTT 60 

TATGTCACGG AGACCACGTT TCAAAAGAAC AGGCTCTTTT TCTACCGGAA GAGTGTCTGG 120 

AGCAAGTTGC AAAGCATTGG AATCAGACAG CACTTGAAGA GGGTGCAGCT GCGGGAGCTG 18 0 

TCGGAAGCAG AGGTCAGGCA GCATCGGGAA GCCAGGCCCG CCCTGCTGAC GTCCAGACTC 24 0 

CGCTTCATCC CCAAGCCTGA CGGGCTGCGG CCGATTGTGA ACATGGACTA CGTCGTGGGA 300 

GCCAGAACGT TCCGCAGAGA AAAGARGGCC GAGCGTCTCA CCTCGAGGGT GAAGGCACTG 360 

TTCAGCGTGC TCAACTACGA GCGGGCGCGG CGCCCCGGCC TCCTGGGCGC CTCTGTGCTG 4 20 

GGCCTGGACG AT AT CC AC AG GGCCTGGCGC ACCTTCGTGC TGCGTGTGCG GGCCCAGGAC 4 80 

CCGCCGCCTG AGCTGTACTT TGTCAAGGTG GATGTGACGG GCGCGTACGA CACCATCCCC 54 0 

CAGGACAGGC TCACGGAGGT CATCGCCAGC ATCATCAAAC CCCAGAACAC GTACTGCGTG 600 

CGTCGGTATG CCGTGGTCCA GAAGGCCGCC ATGGGCACGT CCGCAAGGCC TTCAAGAGCC 660 

AGGTCCTACG TCCAGTGCCA GGGGATCCCG CAGGGCTCCA TCCTCTCCAC GCTGCTCTGC 720 

AGGCTGTGCT ACGGCGACAT GGAGA-ACAAG CTGTTTGCGG GGATTCGGCG GGACGGGCTG 7 30 

C7CC7GCGTT TGGTGGATGA TTTCTTGTTG GTGACACCTC ACCTCACCCA CGCGAAAACC 84 0 

TTCCTCAGGA CCCTGGTCCG AGGTGTCCCT GAGTATGGCT GCGTGGTGAA CTTGCGGAAG 900 

ACAGTGGTGA ACTTCCCTGT AGAAGACGAG GCCCTGGGTG GCACGGCTTT TGTTCAGATG 960 

CCGGCCCACG GCCTATTCCC CTGGTGCGGC CTGCTGCTGG ATACCCGGAC CCTGGAGGTG 1020 

CAGAGCGACT ACTCCAGCTA TGCCCGGACC TCCATCAGAG CCAGTCTCAC CTTCAACCGC 108 0 

GGCTTCAAGG CTGGGAGGAA CATGCGTCGC AAACTCTTTG GGGTCTTGCG GCTGAAGTGT 1140 

CACAGCCTGT TTCTGGATTT GCAGGTGAAC AGCCTCCAGA CGGTGTGCAC CAACATCTAC 12 00 

AAGATCCTCC TGCTGCAGGC GTACAGGTTT CACGCATGTG TGCTGCAGCT CCCATTTCAT 12 60 

CAGCAAGTTT GGAAGAACCC ACATTTTTCC TGCGCGTCAT CTCTGACACG GCTCCCTCTG 1320 

CTACTCCATC CTGAAAGCCA AGAACGCAGG GATGTCGCTG GGGGCCAAGG GCGCCGCCGG 1380 

CCCTCTGCCC TCCGAGGCCG TGCAGTGGCT GTGCCACCAA GCATTCCTGC TCAAGCTGAC 14 4 0 

TCGACACCGT GTCACCTACG TGCCACTCCT GGGGTCACTC AGGACAGCCC AGACGCAGCT 1500 

GAGTCGGAAG CTCCCGGGGA CGACGCTGAC TGCCCTGGAG GCCGCAGCCA ACCCGGCACT 15 60 
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GCCCTCAGAC TTCAAGACCA TCCTGGACTG ATGGCCACCC GCCCACAGCC AGGCCGAGAG 1620 

CAGACACCAG CAGCCCTGTC ACGCCGGGCT TATACGTCCC AGGGAGGGAG GGGCGGCCCA 168 0 

CACCCAGGCC TGCACCGCTG GGAGTCTGAG GCCTGAGTGA GTGTTTGGCC GAGGCCTGCA 17 4 0 

TGTCCGGCTG AAGGCTGAGT GTCCGGCTGA GGCCTGAGCG AGTGTCCAGC CAAGGGCTGA 1800 

GTGTCCAGCA CACCTGCGTT TTCACTTCCC" CACAGGCTGG CGTTCGGTCC ACCCCAGGGC 18 60 

CAGCTTTTCC TCACCAGGAG CCCGGCTTCC ACTCCCCACA TAGGAATAGT CCATCCCCAG 1920 

ATTCGCCATT GTTCACCCTT CGCCCTGCCT TCCTTTGCCT TCCACCCCCA CCATTCAGGT 198 0 

GGAGACCCTG AGAAGGACCC TGGGAGCTTT GGGAATTTGG AGTGACCAAA GGTGTGCCCT 204 0 

GTACACAGGC GAGGACCCTG CACCTGGATG GGGGTCCCTG TGGGTCAAAT TGGGGGGAGG 2100 

TGCTGTGGGA GTAAAATACT GAATATATGA GTTTTTCAGT TTTGGAAAAA AAAAAAAAAA 2160 

AAAAAAAAAA A 2171 

(2) INFORMATION FOR SEQ ID NO: 267: 

(l) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 564 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 



(A) NAME /KEY : Protein 

(B) LOCATION: 1 . . 564 

(D) OTHER INFORMATION: /note= "63 kDa telomerase protein 

encoded by ORE of EcoRI-NotI insert of 
clone 712562** 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 267: 

Met Ser Val Tyr Vai Val Glu Leu Leu Arg Ser Phe Phe Tyr Val Thr 
15 10 15 

Glu Thr Thr Phe Gin Lys Asn Arg Leu Phe Phe Tyr Arg Lys Ser Val 
20 25 30 

Trp Ser Lys Leu Gin Ser lie Gly lie Arg Gin His Leu Lys Arg Val 
35 40 45 

Gin Leu Arg Glu Leu Ser Glu Ala Glu Val Arg Gin His Arg Glu Ala 
50 55 60 

Arg Pro Ala Leu Leu Thr Ser Arg Leu Arg Phe lie Pro Lys Pro Asp 

65 70 75 80 

Gly Leu Arg Pro He Val Asn Met Asp Tyr Val Val Gly Ala Arg Thr 
85 90 95 
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Phe Arg Arg Glu Lys Xaa Ala Glu Arg Leu Thr Ser Arg Val Lys Ala 
100 105 110 

Leu Phe Ser Val Leu Asn Tyr Glu Arg Ala Arg Arg Pro Gly Leu Leu 
115 120 125 

Gly Ala Ser Val Leu Gly Leu Asp Asp lie His Arg Ala Trp Arg Thr 
130 135 140 

Phe Val Leu Arg Val Arg Ala Gin Asp Pro Pro Pro Glu Leu Tyr Phe 
145 150 155 160 

Val Lys Val Asp Val Thr Gly Ala Tyr Asp Thr lie Pro Gin Asp Arg 
165 170 175 

Leu Thr Glu Val He Ala Ser He He Lys Pro Gin Asn Thr Tyr Cys 
180 185 190 

Val Arg Arg Tyr Ala Val Val Gin Lys Ala Ala Met Gly Thr Ser Ala 
195 200 205 

Arg Pro Ser Arg Ala Thr Ser Tyr Val Gin Cys Gin Gly He Pro Gin 
210 215 220 

Gly Ser He Leu Ser Thr Leu Leu Cys Ser Leu Cys Tyr Gly Asp Met 
225 230 235 240 

Glu Asn Lys Leu Phe Ala Gly lie Arg Arg Asp Gly Leu Leu Leu Arg 
245 250 255 

Leu Val Asp Asp Phe Leu Leu Val Thr Pro His Leu Thr His Ala Lys 
260 265 270 

Thr Phe Leu Arg Thr Leu Val Arg Gly Val Pro Glu Tyr Gly Cys Val 

275 280 235 

Val Asn Leu Arg Lys Thr Val Vai Asn Phe Pro Val Glu Asp Glu Ala 

290 295 300 

Leu Giy Gly Thr Ala Phe Val Gin Met Pro Ala His Gly Leu Phe Pro 
305 * 310 315 320 

Trp Cys Gly Leu Leu Leu Asp Thr Arg Thr Leu Glu Val Gin Ser Asp 
325 330 335 

Tyr Ser Ser Tyr Ala Arg Thr Ser He Arg Ala Ser Leu Thr Phe Asn 
340 345 350 

Arg Gly Phe Lys Ala Gly Arg Asn Met Arg Arg Lys Leu Phe Gly Val 
355 360 365 

Leu Arg Leu Lys Cys His Ser Leu Phe Leu Asp Leu Gin Val Asn Ser 
370 375 380 

Leu Gin Thr Val Cys Thr Asn He Tyr Lys He Leu Leu Leu Gin Ala 
385 390 395 400 



Tyr Arg Phe His Ala Cys Val Leu Gin Leu Pro Phe His Gin Gin Val 
405 410 415 
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Trp Lys Asn Pro His Phe Ser Cys Ala Ser Ser Leu Thr Arg Leu Pro 
420 425 430 

Leu Leu Leu His Pro Glu Ser Gin Giu Arg Arg Asp Val Ala Gly Gly 
435 440 445 

Gin Gly Arg Arg Arg Pro Ser Ala Leu Arg Gly Arg Ala Val Ala Val 
450 455 460 

Pro Pro Ser lie Pro Ala Gin Ala Asp Ser Thr Pro Cys His Leu Arg 
465 470 475 480 

Ala Thr Pro Gly Val Thr Gin Asp Ser Pro Asp Ala Ala Glu Ser Glu 
485 490 495 

Ala Pro Gly Asp Asp Ala Asp Cys Pro Gly Gly Arg Ser Gin Pro Gly 
500 505 510 

Thr Ala Leu Arg Leu Gin Asp His Pro Gly Leu Met Ala Thr Arg Pro 
515 520 525 

Gin Pro Gly Arg Glu Gin Thr Pro Ala Ala Leu Ser Arg Arg Ala Tyr 
530 535 540 

Thr Ser Gin Gly Gly Arg Gly Gly Pro His Pro Gly Leu His Arg Trp 
545 550 555 560 

Giu Ser Glu Ala 



INFORMATION FOR SEQ ID NO: 268: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 22 amino acids 

■!3) TYPE: amino acid 

;C) STRANDEDNESS : 

( D } TOPOLOGY : linear 

(ii) MOLECULE TYPE: peptide 



(ix) FEATURE: 

(A) NAME /KEY : Peptide 
(3) LOCATION: 1 . . 22 

(D) OTHER INFORMATION: /note- "motif 1 peptide from 

Euolo~es aediculatus p!23" 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 268: 

Leu Val Val Ser Leu lie Arg Cys Phe Phe Tyr Val Thr Glu Gin Gin 
15 10 15 

Lys Ser Tyr Ser Lys Thr 
20 
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(2) INFORMATION FOR SEQ ID NO: 269: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 30 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: peptide 



(ix) _ FEATURE : 

(A) NAME/KEY: Peptide 

(B) LOCATION: 1 . . 30 

(D) OTHER INFORMATION: /note= "motif 0 peptide from 

Euplotes aediculatus p!23" 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO:269: 

Lys Ser Leu Gly Phe Ala Pro Gly Lys Leu Arg Leu He Pro Lys Lys 
1 5 10 15 

Thr Thr Phe Arg Pro He Met Thr Phe Asn Lys Lys He Val 
20 25 30 



(2) INFORMATION FOR SEQ ID MO: 270: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 27 amino acids 
(3) TYPE: amino acid 

(C) STRANDEDNESS : 

(D) TOPOLOGY: linear 

{11} MOLECULE TYPE: peptide 



( ix) FEATURE: 

(A) NAME/KEY: Peptide 
(3) LOCATION: 1 . . 27 

(D) OTHER INFORMATION: /note= "motif A peptide from 

Euplotes aediculatus pl23" 



(xi) SEQUENCE DESCRIPTION: SEQ ID MO: 270: 

Pro Lys Leu Phe Phe Ala Thr Met Asp He Glu Lys Cys Tyr Asp Ser 
15 10 15 

Val Asn Arg Glu Lys Leu Ser Thr Phe Leu Lys 
20 25 



(2) INFORMATION FOR SEQ ID NO: 271: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 27 amino acids 
(3) TYPE: amino acid 

(C) STRANDEDNESS : 

( D) TOPOLOGY: linear 
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(ii) MOLECULE TYPE: peptide 



(ix) FEATURE : 

(A) NAME/KEY: Peptide 

(B) LOCATION: 1..27 

(D) OTHER INFORMATION: /note= "motif B peptide from 

Euplotes aediculatus pl23" 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO:271: 

Asn Gly Lys Phe Tyr Lys Gin Thr Lys Gly He Pro Gin Gly Leu Cys 
15 10 15 

Val Ser Ser He Leu Ser Ser Phe Tyr Tyr Ala 
20 25 



(2) INFORMATION FOR SEQ ID NO: 272: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 22 amino acids 
, (B) TYPE: amino acid 

(C) STRANDEDNESS : 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: peptide 



(ix) FEATURE: 

(A) NAME /KEY : Peptide 
(3) LOCATION: 22 

(D) OTHER INFORMATION: /note= "motif C peptide from 

Euplotes aediculatus pl23" 



(>:i) SEQUENCE DESCRIPTION: SEQ ID NO:272: 

Pro Asn Val Asn Leu Leu Met Arg Leu Thr Asp Asp Tyr Leu Leu He 
1 5 10 15 

i 

Thr Thr Gin Glu Asn Asn 
20 



(2) INFORMATION FOR SEQ ID NO:273: 

(i) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 15 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: 

(D) TOPOLOGY : linear 



(ii) MOLECULE TYPE: 



peptide 




(ix) FEATURE: 

(A) NAME/KEY: Peptide 

(B) LOCATION: 1. . 15 

(D) OTHER INFORMATION: /note= "motif D peptide from 

Eupiotes aediculatus p!23" 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO:273: 

Asn Val Ser Arg Glu Asn Gly Phe Lys Phe Asn Met Lys Lys Leu 
1 5 10 15 



(2) INFORMATION FOR SEQ ID NO: 274: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 22 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: peptide 



(ix) FEATURE: 

(A) NAME /KEY: Peptide 
(3) LOCATION: 1 . . 22 

(D) OTHER INFORMATION: /note= "motif 1 peptide from 

Schi zosaccharomyces pombe tezl" 



(xi) SEQUENCE DESCRIPTION: SEQ ID MO:274: 

Phe lie lie Pro He Leu Gin Ser Phe Phe Tyr He Thr Glu Ser Ser 
15 10 15 

Asp Leu Arg Asn Arg Thr 
20 



(2) INFORMATION FOR SEQ ID MO:275: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 30 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: peptide 



(ix) FEATURE: 

(A) NAME /KEY : Peptide 
(3) LOCATION: 1 . . 30 

(D) OTHER INFORMATION: /note- "motif 0 peptide from 

Schi zosaccha romyces pombe tezl" 
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(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 275: 

Gin Lys Thr Thr Leu Pro Pro Ala Val He Arg Leu Leu Pro Lys Lys 
15 10 15 

Asn Thr Phe Arg Leu He Thr Asn Leu Arg Lys Arg Phe Leu 
20 25 30 



(2) INFORMATION FOR SEQ ID NO:276: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 27 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: peptide 



(ix) FEATURE: 

(A) NAME /KEY : Peptide 

(B) LOCATION: 1 . . 27 

(D) OTHER INFORMATION: /note= "motif A peptide from 

Schizosaccharomyces pombe tez 1 " 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO:276: 

Arg Lys Lys Tyr Phe Val Arg He Asp lie Lys Ser Cys Tyr Asp Arg 
1 ' 5 10 15" 

lie Lys Gin Asp Leu Met Phe Arg He Val Lys 
20 25 



(2) INFORMATION FOR SEQ ID MO: 27 7: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 27 amino acids 
(3) TYPE: amino acid 

(C) STRANDEDNESS : 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: peptide 



(ix) FEATURE: 

(A) NAME/KEY: Peptide 

(B) LOCATION: 1 . . 27 

(D) OTHER INFORMATION: /note- "motif B peptide from 

Schizosaccharomyces pombe tezl" 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO:277: 

Gly Asn Ser Gin Tyr Leu Gin Lys Val Gly He Pro Gin Gly Ser He 
1 5 10 15 



Leu Ser Ser Phe Leu Cys His Phe Tyr Met Glu 
20 25 
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(2) INFORMATION FOR SEQ ID NO: 278: 

(i) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 22 amino acids 
(3) TYPE: amino acid 

(C) STRANDEDNESS: 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: peptide 



(ix) FEATURE: 

{A) NAME/KEY: Peptide 
(B) LOCATION: 1..22 

(D) OTHER INFORMATION: /note- "motif C peptide from 

Schi zosaccharomyces pombe tezl' 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 278: 

Lys Lys Gly Ser Val Leu Leu Arg Val Val Asp Asp Phe Leu Phe lie 
1 5 10 * 15 

Thr Val Asn Lys Lys Asp 
20 



!2) INFORMATION FOR SEQ ID NO: 279: 

(i) SEQUENCE CHARACTERISTICS: 

{A} LENGTH: 15 amino acids 
(5) TYPE: amino acid 

(C) STRANDEDNESS: 

(D) TOPOLOGY: linear 

(Li} MOLECULE TYPE: peotide 



(ix) FEATURE: 

(A) NAME /KEY : Peptide 
(3) LOCATION: 1. . 15 

(D) OTHER INFORMATION: /note= "motif D peptide from 

Schizosaccharomyces pombe tezl' 



(xi) SEQUENCE DESCRIPTION: SEQ ID MO:279: 

Leu Asn Leu Ser Leu Arg Gly Phe Glu Lys His Asn Phe Ser Thr 
1 5 10 15 



(2) INFORMATION FOR SEQ ID MO: 280: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 22 amino acids 
(3) TYPE: amino acid 

(C) STRANDEDNESS : 

(D) TOPOLOGY: linear 



(ii) MOLECULE TYPE: peptide 
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(ix) FEATURE : 

(A) NAME/KEY: Peptide 

(B) LOCATION: 1..22 

(D) OTHER INFORMATION: /note= "motif 1 peptide from 

Saccharomyces cerevisiae EST2" 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO:280: 

Leu lie Pro Lys lie lie Gin Thr Phe Phe Tyr Cys Thr Glu lie Ser 
1 5 10 15 

Ser Thr Val Thr He Val 
20 



(2) INFORMATION FOR SEQ ID NO: 281: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 32 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: peptide 



(ix) FEATURE: 

(A) NAME/KEY: Peptide 
(3) LOCATION: 1 . . 32 

{ D) OTHER INFORMATION: /note= "motif 0 peptide from 

Saccharomyces cerevisiae EST2" 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 281: 

Thr Leu Ser Asn Phe Asn His Ser Lys Met Arg lie lie Pro Lys Lys 
15 10 15 

Ser Asn Asn Glu Phe Arg He He Ala He Pro Cys Arg Gly AH Asp 
20 25 30 



(2) INFORMATION FOR SEQ ID NO: 262: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 27 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: peptide 



(ix) FEATURE: 

(A) NAME/KEY: Peptide 

(B) LOCATION: 1. .27 

(D) OTHER INFORMATION: /note= "motif A peptide from 

Saccharomyces cerevisiae EST2 " 
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(xi) SEQUENCE DESCRIPTION: SEQ ID NO:282: 

Pro Glu Leu Tyr Phe Met Lys Phe Asp Val Lys Ser Cys Tyr Asp Ser 
15 10 15 

lie Pro Arg Met Glu Cys Met Arg lie Leu Lys 
20 25 



(2) INFORMATION FOR SEQ ID NO: 283: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 27 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: peptide 



(ix) FEATURE: 

(A) NAME/KEY: Peptide 

(B) LOCATION: 1..27 

(D) OTHER INFORMATION: /note= "motif B peptide from 

Saccharomyces cerevisiae EST2' 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO:283: 

Glu Asp Lys Cys Tyr lie Arg Glu Asp Gly Leu Phe Gin Gly Ser Ser 
1 5 ^ " 10 15 

Leu Ser Ala Pro He Val Asp Leu Val Tyr Asp 
20 25 



I N FORMAT ION FOR SEQ ID MO : 2 8 4 : 

(i) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 22 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: peptide 



(ix) FEATURE: 

(A) NAME /KEY : Peptide 

(B) LOCATION: 1 . . 22 

(D) OTHER INFORMATION: /note= "motif C peptide from 

Saccharomyces cerevisiae EST2' 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 284: 

Ser Gin Asp Thr Leu He Leu Lys Leu Ala Asp Asp Phe Leu He He 
15 10 15 



Ser Thr Asp Gin Gin Gin 
20 
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(2) INFORMATION FOR SEQ ID NO:285: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 15 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: peptide 



(ix) FEATURE: 

(A) NAME/KEY: Peptide 

(B) LOCATION: 1..15 

(D) OTHER INFORMATION: /note= "motif D peptide from 

Saccharomyces cerevisiae EST2 " 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO:285: 

Lys Lys Leu Ala Met Gly Gly Phe Gin Lys Tyr Asn Ala Lys Ala 
1 5 10 15 



{2} INFORMATION FOR SEQ ID NO: 286: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 22 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: 

(D) ■ TOPOLOGY: linear 

(ii) MOLECULE TYPE: peptide 



(ix) FEATURE: 

(A) NAME/ KEY : Peptide 
(3) LOCATION: 1 . . 22 

(D) OTHER INFORMATION: /note= "motif 1 peptide from human 

telor.erase core protein 1 (TCP!)" 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 286: 

Tyr Val Val Glu Leu Leu Arg Ser Phe Phe Tyr Val Thr Glu Thr Thr 
15 10 15 

Phe Gin Lys Asn Arg Leu 
20 



(2) INFORMATION FOR SEQ ID NO: 287: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH:. 30 amino acids 
(3) TYPE: amino acid 

(C) STRANDEDNESS : 

(D) TOPOLOGY: linear 



(ii) MOLECULE TYPE: peptide 
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(ix) FEATURE: 

(A) NAME /KEY : Peptide 

(B) LOCATION: 1 . . 30 

(D) OTHER INFORMATION: /note= "motif 0 peptide from human 

telomerase core protein 1 (TCP1)" 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 287: 

Ala Arg Pro Ala Leu Leu Thr Ser Arg Leu Arg Phe He Pro Lys Pro 
15 10 15 

Asp Gly Leu Arg Pro He Val Asn Met Asp Tyr Val Val Gly 
20 25 30 



(2) INFORMATION FOR SEQ ID NO: 288: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 27 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: peptide 



(ix) FEATURE: 

(A) NAME /KEY : Peptide 

(B) LOCATION: 1..27 

(D) OTHER INFORMATION: /noee= "motif A peptide from human 

telo~erase core protein 1 (TC?1)" 



(>:i) SEQUENCE DESCRIPTION: SEQ ID NO: 288: 

Pro Glu Leu Tyr Phe Val Lys Val Asp Val Thr Gly Ala Tyr Asp Thr 
1 5 10 15' 

lie Pro Gin Asp Arg Leu Thr Glu Val He Ala 
20 25 



(2) INFORMATION FOR SEQ ID NO: 23 9: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 27 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: peptide 



(ix) FEATURE: 

(A) NAME /KEY : Protein 

(B) LOCATION: 1..27 

(D) OTHER INFORMATION: /note- "motif B peptide from human 

telomerase core protein 1 (TCP1)" 




(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 289: 

Arg Ala Thr Ser Tyr Val Gin Cys Gin Gly He Pro Gin Gly Ser He 
15 10 15 

Leu Ser Thr Leu Leu Cys Ser Leu Cys Tyr Gly 
20 25 



(2) INFORMATION FOR SEQ ID NO: 290: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 22 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: peptide 



(ix) FEATURE: 

(A) NAME /KEY : Peptide 

(B) LOCATION: 1..22 

(D) OTHER INFORMATION: /note* "motif C peptide from human 

telomerase core protein 1 (TCP1)" 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 290: 

Arg Arg Asp Gly Leu Leu Leu Arg Leu Val Asp Asp Phe Leu Leu Val 
1 5 10 ' ' 15 

Thr Pro His Leu Thr His 
20 



(2) I N FORMATION FOR SEQ ID MO: 291: 

(i) SEQUENCE CHARACTERISTICS : 

(A) LENGTH : 15 amino acids 

(B) TYPE : amino acid 

(C) STRANDEDNESS : 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: peptide 



(ix) FEATURE : 

(A) NAME/KEY: Peptide 

(B) LOCATION: 1..15 

(D) OTHER INFORMATION: /note- "motif D peptide from human 

telomerase core protein 1 {TCP!}" 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 291: 

Leu Arg Thr Leu Val Arg Gly Val Pro Glu Tyr Gly Cys Val Val 
15 10 15 
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(2) INFORMATION FOR SEQ ID NO: 292: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 4029 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA 



(ix) FEATURE: 

(A) NAME/KEY: - 

(B) LOCATION: 1..4029 

(D) OTHER INFORMATION: /note- "preliminary sequence for 

human TRT cDNA insert of 
plasmid pGRN121" 

(xi) sequence description: seq id no:292: 

gcagcgctgc gtcctgctgc gcacgtggga agccctggcc ccggccaccc ccgcgatgcc 60 

gcgcgctccc cgctgccgag ccgtgcgctc cctgctgcgc agccactacc gcgaggtgct 120 

gccgctggcc acgttcgtgc ggcgcctggg gccccagggc tggcggctgg tgcagcgcgg 180 

ggacccggcg gctttccgcg cgntggtggc ccamtgcntg gtgtgcgtgc cctgggangn 24 0 

amggcmgccc cccgccgccc cctccttccg ccaggtgtcc tgcctgaamg anctggtggc 300 

ccgagtgctg camangctgt gcgamcgcgg cgcgaamaac gtgctggcct tcggcttcgc 360 

gctgctggac ggggcccgcg ggggcccccc cgaggccttc accaccagcg tgcgcagcta 4 20 

cgtgcccaac acggtgaccg acgcactgcg ggggagcggg gcgtgggggc tgctgctgcg 430 

cggggtgggc gacgacgtgc tggttcacct gctggcacgc tgcgcgntmt ttgtgctggt 54 0 

cgmtcccagc tgcgcctacc angtgtgcgg gccgccgctg taccagctcg gcgctgcnac 600 

tcaggcccgg cccccgccac acgctamtgg acccgaangc gtctgggatc caacgggcct 660 

ggaacgatag cgtcagggag gccggggtcc ccctgggctg ccagccccgg gtgcgaggag 720 

gcgcgggggc agtgccagcc gaagtctgcc gttgcccaag aggcccaggc gtggcgctgc 780 

ccctgagccg gagcggacgc ccgttgggca ggggtcctgg gcccacccgg gcaggacgcc 84 0 

tggaccgagt gaccgtggtt tctgtgtggt gtcacctgcc agacccgccg aagaagccac 900 

ctctttggag ggtgcgctct ctggcacgcg ccactcccac ccatccgtgg gccgccagca 960 

ccacgcgggc cccccatcca catcgcggcc accacgtcct gggacacgcc ttgtcccccg 1020 

gtgtacgccg agaccaagca cttcctctac tcctcaggcg acaagmacac tgcgnccctc 108 0 

cttcctactc aatatatctg aggcccagcc tgactggcgt tcgggaggtt cgtggagaca 114 0 

mtctttctgg ttccaggcct tggatgccag gattccccgc aggttgcccc gcctgcccca 1200 
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GCGNTACTGG CAAATGCGGC CCCTGTTTCT GGAGCTGCTT GGGAACCACG CGCAGTGCCC 12 60 

CTACGGGGTG TTCCTCAAGA CGCACTGCCC GCTGCGAGCT GCGGTCACCC CAGCAGCCGG 132 0 

TGTCTGTGCC CGGGAGAAGC CCCAGGGCTC TGTGGCGGCC CCCGAGGAGG AGGAACACAG 1380 

ACCCCCGTCG CCTGGTGCAG CTGCTCCGCC AGCACAGCAG CCCCTGGCAG GTGTACGGCT 14 4 0 

TCGTGCGGGC CTGCCTGCGC CGGCTGGTGC CCCCAGGCCT CTGGGGCTCC AGGCACAACG 1500 

AACGCCGCTT CCTCAGGAAC ACCAAGAAGT TCATCTCCCT GGGGAAGCAT GCCAAGCTCT 15 60 

CGCTGCAGGA GCTGACGTGG AAGATGAGCG TGCGGGACTG CGCTTGGCTG CGCAGGAGCC 1620 

CAGGGGTTGG CTGTGTTCCG GCCGCAGAGC ACCGTCTGCG TGAGGAGATC CTGGCCAAGT 1680 

TCCTGCACTG GCTGATGAGT GTGTACGTCG TCGAGCTGCT CAGGTCTTTC TTTTATGTCA 174 0 

CGGAGACCAC GTTTCAAAAG AACAGGCTCT TTTTCTACCG GAAGAGTGTC TGGAGCAAGT 1800 

TGCAAAGCAT TGGAATCAGA CAGCACTTGA AGAGGGTGCA GCTGCGGGAG CTGTCGGAAG 18 60 

CAGAGGTCAG GCAGCATCGG GAAGCCAGGC CCGCCCTGCT GACGTCCAGA CTCCGCTTCA 1920 

TCCCCAAGCC TGACGGGCTG CGGCCGATTG TGAACATGGA CTACGTCGTG GGAGCCAGAA 198 0 

CGTTCCGCAG AGAAAAGAGG GCCGAGCGTC TCACCTCGAG GGTGAAGGCA CTGTTCAGCG 204 0 

TGCTCAACTA CGAGCGGGCG CGGCGCCCCG GCCTCCTGGG CGCCTCTGTG CTGGGCCTGG 2100 

ACGATATCCA CAGGGCCTGG CGCACCTTCG TGCTGCGTGT GCGGGCCCAG GACCCGCCGC 2160 

CTGAGCTGTA CTTTGTCAAG GTGGATGTGA CGGGCGCGTA CGACACCATC CCCCAGGACA 222 0 

GGCTCACGCA GGTCATCGCC AGCATCATCA AACCCCAGAA CACGTACTGC GTGCGTCGGT 228 0 

ATGCCGTGGT CCAGAAGGCC GCCCATGGGC ACGTCCGCAA GGCCTTCAAG AGGCACGTCT 2 34 0 

CTACCTTGAC AGACCTCCAG CCGTACATGC GACAGTTCGT GGCTCACCTG CAGGAMAACA 2 4 00 

GCCCGCTGAG GGATGCCGTC GTCATCGAGC AGAGCTCCTC CCTGAATGAG GCCAGCAGTG 24 60 

GCCTCTTCGA CGTCTTCCTA CGCTTCATGT GCCACCACGC CGTGCGCATC AGGGGCAAGT 2 520 

CCTACGTCCA GTGCCAGGGG ATCCCGCAGG GCTCCATCCT CTCCACGCTG CTCTGCAGCC 2 58 0 

TGTGCTACGG CGACATGGAG AACAAGCTGT TTGCGGGGAT TCGGCGGGAC GGGCTGCTCC 2 64 0 

TGCGTTTGGT GGATGATTTC TTGTTGGTGA CACCTCACCT CACCCACGCG AAAACCTTCC 2700 

TCAGGACCCT GGTCCGAGGT GTCCCTGAGT ATGGCTGCGT GGTGAACTTG CGGAAGACAG 27 60 

TGGTGAACTT CCCTGTAGAA GACGAGGCCC TGGGTGGCAC GGCTTTTGTT CAGATGCCGG 2820 

CCCACGGCCT ATTCCCCTGG TGCGGCCTGC TGCTGGATAC CCGGACCCTG GAGGTGCAGA 2 8 80 

GCGACTACTC CAGCTATGCC CGGACCTCCA TCAGAGCCAG TCTCACCTTC AACCGCGGCT 2 94 0 

TCAAGGCTGG GAGGAACATG CGTCGCAAAC TCTTTGGGGT CTTGCGGCTG AAGTGTCACA 3000 

GCCTGTTTCT GGATTTGCAG GTGAACAGCC TCCAGACGGT GTGCAGCAAC ATCTACAAGA 30 60 
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TCCTCCTGCT GCAGGCGTAC AGGTTTCACG CATGTGTGCT GCAGCTCCCA TTTCATCAGC 3120 

AAGTTTGGAA GAACCCCACA TTTTTCCTGC GCGTCATCTC TGACACGGCC TCCCTCTGCT 3180 

ACTCCATCCT GAAAGCCAAG AACGCAGGGA TGTCGCTGGG GGCCAAGGGC GCCGCCGGCC 32 4 0 

CTCTGCCCTC CGAGGCCGTG CAGTGGCTGT GCCACCAAGC ATTCCTGCTC AAGCTGACTC 3300 

GACACCGTGT CACCTACGTG CCACTCCTGG GGTCACTCAG GACAGCCCAG ACGCAGCTGA 3360 

GTCGGAAGCT CCCGGGGACG ACGCTGACTG CCCTGGAGGC CGCAGCCAAC CCGGCACTGC 34 20 

CCTCAGACTT CAAGACCATC CTGGACTGAT GGCCACCCGC CCACAGCCAG GCCGAGAGCA 34 80 

GACACCAGCA GCCCTGTCAC GCCGGGCTCT ACGTCCCAGG GAGGGAGGGG CGGCCCACAC 354 0 

CCAGGCCCGC ACCGCTGGGA GTCTGAGGCC TGAGTGAGTG TTTGGCCGAG GCCTGCATGT 3600 

CCGGCTGAAG GCTGAGTGTC CGGCTGAGGC CTGAGCGAGT GTCCAGCCAA GGGCTGAGTG 3660 

TCCAGCACAC CTGCCGTCTT CACTTCCCCA CAGGCTGGCG CTCGGCTCCA CCCCAGGGCC 3720 

AGCTTTTCCT CACCAGGAGC CCGGCTTCCA CTCCCCACAT AGGAATAGTC CATCCCCAGA 37 80 

TTCGCCATTG TTCACCCCTC GCCCTGCCCT CCTTTGCCTT CCACCCCCAC CATCCAGGTG 38 4 0 

GAGACCCTGA GAAGGACCCT GGGAGCTCTG GGAATTTGGA GTGACCAAAG GTGTGCCCTG 3900 

TACACAGGCG AGGACCCTGC ACCTGGATGG GGGTCCCTGT GGGTCAAATT GGGGGGAGGT 3960 

GCTGTGGGAG TAAAATACTG AATATATGAG TTTTTCAGTT TTGAAAAAAA AAAAAAAAAA 4 020 

AAAAAAAAA 4 02 9 

!2) INFORMATION FOR SEQ ID MO: 293: 

(i) SEQUENCE CHARACTERISTICS: 

{ A) LENGTH: 261 amino acids 
(3) TYPE: amino acid 

(C) ST HANDEDNESS : 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: peptide 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 293: 

Ala Ala Leu Arg Pro Ala Ala His Val Gly Ser Pro Gly Pro Gly His 
1 5 10 ' 15 

Pro Arg Asp Ala Ala Arg Ser Pro Leu Pro Ser Arg Ala Leu Pro Ala 
20 25 30 

Ala Gin Pro Leu Pro Arg Gly Ala Ala Ala Gly His Val Arg Ala Ala 
35 40 45 



Pro Gly Ala Pro Gly Leu Ala Ala Gly Ala Ala Arg Gly Pro Gly Gly 
50 55 60 
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Phe Pro Arg Xaa Gly Gly Pro Xaa Xaa Gly Val Arg Ala Leu Gly Xaa 
65 70 75 80 

Xaa Ala Ala Pro Arg Arg Pro Leu Leu Pro Pro Gly Val Leu Pro Glu 
85 90 95 

Xaa Xaa Gly Gly Pro Ser Ala Ala Xaa Ala Val Arg Xaa Arg Arg Glu 
100 105 110 

Xaa Arg Ala Gly Leu Arg Leu Arg Ala Ala Gly Arg Gly Pro Arg Gly 
115 120 125 

Pro Pro Arg Gly Leu His His Gin Arg Ala Gin Leu Pro Ala Gin His 
130 135 140 

Gly Asp Arg Arg Thr Ala Gly Glu Arg Gly Val Gly Ala Ala Ala Ala 
145 , 150 155 160 

Pro Arg Gly Arg Arg Arg Ala Gly Ser Pro Ala Gly Thr Leu Arg Xaa 
165 170 175 

Xaa Cys Ala Gly Gly Ser Gin Leu Arg Leu Pro Xaa Val Arg Ala Ala 
180 185 190 

Ala Val Pro Ala Arg Arg Cys Xaa Ser Gly Pro Ala Pro Ala Thr Arg 
195 200 205 

Xaa Trp Thr Arg Xaa Arg Leu Gly Ser Asn Gly Pro Gly Thr lie Ala 
210 215 220 

Ser Gly Arg Pro Gly Ser Pro Trp Ala Ala Ser Pro Gly Cys Glu Glu 

225 230 235 240 

Ala Arg Gly Gin Cys Gin Pro Lys Ser Ala Vai Ala Gin Glu Ala Gin 
245 250 255 



Ala Trp Arg Cys Pro 
260 



(2) INFORMATION FOR SEQ ID MO:294: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 21 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: peptide 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 294: 

Ala Gly Ala Asp Ala Arg Trp Ala Gly Val Leu Gly Pro Pro Gly Gin 
15 10 15 



Asp Ala Trp Thr Glu 
20 



198 



(2) INFORMATION FOR SEQ ID NO: 295: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 82 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: peptide 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO:295: 

Pro Trp Phe Leu Cys Gly Val Thr Cys Gin Thr Arg Arg Arg Ser His 
15 10 15 

Leu Phe Gly Gly Cys Ala Leu Trp His Ala Pro Leu Pro Pro lie Arg 
20 25 30 

Gly Pro Pro Ala Pro Arg Gly Pro Pro He His He Ala Ala Thr Thr 
35 40 45 

Ser Trp Asp Thr Pro Cys Pro Pro Val Tyr Ala Glu Thr Lys His Phe 
50 55 60 

Leu Tyr Ser Ser Gly Asp Lys Xaa Thr Ala Xaa Leu Leu Pro Thr Gin 
65 70 75 80 

Tyr He 



(2) INFORMATION FOR SEQ ID NO: 2 96: 

(i) SEQUENCE CHARACTERISTICS: 

{ A) LENGTH: 153 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: peptide 



(xi) SEQUENCE DESCRI PTION : ■ SEQ ID NO:296: 



Leu Ala Phe Gly Arg Phe Val Glu 
1 5 

Gly Cys Gin Asp Ser Pro Gin Val 
20 

Ala Asn Ala Ala Pro Val Ser Gly 
35 40 

Pro Leu Arg Gly Val Pro Gin Asp 
50 55 

His Pro Ser Ser Arg Cys Leu Cys 
65 70 



Thr Xaa Phe Leu Val Pro Gly Leu 
10 15 

Ala Pro Pro Ala Pro Ala Xaa Leu 
25 30 

Ala Ala Trp Glu Pro Arg Ala Val 
45 

Ala Leu Pro Ala Ala Ser Cys Gly 
60 

Pro Gly Glu Ala Pro Gly Leu Cys 
75 80 



Gly Gly 



Pro Arg Gly Gly 
85 



Gly Thr 



Gin Thr Pro Val Ala Trp Cys Ser 
90 95 
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Cys Ser Ala Ser Thr Ala Ala Pro Gly Arg Cys Thr Ala Ser Cys Gly 
100 105 110 

Pro Ala Cys Ala Gly Trp Cys Pro Gin Ala Ser Gly Ala Pro Gly Thr 
115 120 125 

Thr Asn Ala Ala Ser Ser Gly .Thr Pro Arg Ser Ser Ser Pro Trp Gly 
130 135 140 

Ser Met Pro Ser Ser Arg Cys Arg Ser 
145 150 



(2) INFORMATION FOR SEQ ID NO: 297: 

(i) SEQUENCE CHARACTERISTICS: 

{A) LENGTH: 35 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: 

(D) TOPOLOGY: linear 

(li) MOLECULE TYPE: peptide 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 297: 

Ala Cys Gly Thr Ala Leu Gly Cys Ala Gly Ala Gin Gly Leu Ala Val 
15 10 15 

Phe Arg Pro Gin Ser Thr Val Cys Val Arg Arg Ser Trp Pro Ser Ser 
20 25 30 

Cys Thr Gly 
35 



(2) INFORMATION FOR SEQ ID NO: 293: 

(i) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 43 ammo acids 
■(B) TYPE: amino acid 

(C) STRANDEDNESS: 

(D) TOPOLOGY: linear 

(li) MOLECULE TYPE: peptide 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO:298: 

Val Cys Thr Ser Ser Ser Cys Ser Gly Leu Ser Phe Met Ser Arg Arg 
1 5 10 15 

Pro Arg Phe Lys Arg Thr Gly Ser Phe Ser Thr Gly Arg Val Ser Gly 
20 25 30 

Ala Ser Cys Lys Ala Leu Glu Ser Asp Ser Thr 
35 40 
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(2) INFORMATION FOR SEQ ID NO: 299: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 23 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: peptide 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO:299: 

Arg Gly Cys Ser Cys Gly Ser Cys Arg Lys Gin Arg Ser Gly Ser lie 
15 10 15 

Gly Lys Pro Gly Pro Pro Cys 



(2} INFORMATION FOR SEQ ID NO:300: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: ^16 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: peptide 



SEQUENCE DESCRIPTION: SEQ ID MO: 300: 

Arg Pro Asp Ser Ala Ser Ser Pro Ser Leu Thr Gly Cys Gly Arg Leu 
1^5 10 15 



(2} INFORMATION FOR SEQ ID NO: 301: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 23 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: peptide 



(:-:i) SEQUENCE DESCRIPTION: SEQ ID MO: 301: 

Thr Trp Thr Thr Ser Trp GIu Pro Glu Arg Ser Ala Glu Lys Arg Gly 



20 



1 



5 



10 



15 



Pro Ser Val Ser Pro Arg Gly 
20 



